This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the appHcant. 

Defects in the images may include (but are not Hmited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY, 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



PCT 



WORLD INTELLECnjAL PROPERTY ORGANIZATION 
IntematioiuJ Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 
(51). International Patent Classification ^ : 



C12P 19/34, C12Q 1/68, C12N 9A2 



Al 



(11) InternaUona) Publication Number: WO 98/42860 

(43) International Publication Date: I October 1998 (01.10.98) 



(21) International Application Number: PCr/US98/0S497 

(22) International Filing Date: 20 March 1998 (20.03.98) 



(30) Priority Data: 

08/822.774 
08/957.709 



21 March 1997 (21.03.97) US 
24 October 1997 (24.10.97) US 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Application 

US 08/957,709 (CIP) 

Filed on 24 O:tober 1997 (24.10.97) 



(71) Applicant ijor all designated States except US): STRATA- 

GENE tUS/US]; Suite 250. 12526 High Bluff Drive, San 
Diego, CA 92130 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only); HOGREFE. Holly 
[US/US]; 6045 Diiac Street, San Diego, CA 92122 (US). 
HANSEN, Connie. J. [US/US]; 3645 Gaylc Street, San 
Diego. CA 92115 (US). 



(74) Agents: GARRETT, Arthur, S. et al.; Finnegan, Henderson, 
Farabow. Ganttt & Dunncr, L.L.P., 1300 I Street, N. W., 
Washington, DC 20005-3315 (US). 



(81) Designated States: CA, JP, US, European patent (AT, BE, CH. 
DE. DK, ES, FI, FR, GB, GR, IE. IT, LU, MC. NL. PT. 
SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) TlUe: POLYMERASE ENHANCING FACTOR (PEF) EXTRACTS, PEF PROTEIN COMPLEXES. ISOLATED PEF PROTEIN, 
AND METHODS FOR PURIFYING AND IDENTIFYING 



(57) Abstract 



The invention provides novel extracts, proteins, and complexes that improve the polymerization activity of nucleic acid polymerases. 
Included within the aspects of the invention are methods for identifying compositions with a polymerase enhancing activity, methods for 
purifying and using these compositions, and specific extracts, proteins, and complexes that function to enhance polymerase activity. As an 
example, specifically described is nucleotide and amino acid sequence infomiation for a Pyrococcus furiousus PEF (P45). which was used 
to produce a recombinant PEF. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States paity to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


US 


Lesotho 


SI 


Stovenia 


AM 


Armenii 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Aoitria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


OA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


" Bosnia and HenegovinA 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Buricina Faso 


GR 


Gieece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Meiioo 


UZ 


Uzbekistan 


CP 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlandi 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyigyxttan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cfite d'lvoiic 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Repobltc of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazalcstan 


R 


Romania 






CZ 


Czech Republic 


LC 


Sabu Lucia 


RU 


Russian Federation 






DE 


Geimafly 


U 


IriffclHennein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Swedes 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 98/42860 



PCT/US98/05497 



.1- 

Polymerase Enhancing Factor (PEF) Extracts, PEF 
Protein Complexes, Isolated PEF Protein , and 
Methods for Purifying and Identifying 

Related Application Information 

This application claims priority to and is a continuation-in-part of U.S. Patent 
application serial no. 08/957,709. filed October 24. 1997. which is a continuation-in- 
part of U.S. Patent application serial no. 08/822,774. filed March 21. 1997. The 
entire contents of those applications are specifically incorporated herein by 
reference and may be relied on to make and use embodiments of the claimed 
invention. 

Background of the Invention 

Field of the Invention 

The invention relates to the fields of nucleic acid polymerases and nucleic 
acid polymerization reactions. 
Introduction 

The efficiency of a nucleic acid polymerization reaction has implications for 
numerous assays and techniques. For example, the ability to enhance polymerase 
activity in a PGR process increases the sensitivity of the PCR-based assay. We 
have identified, produced, purified, and analyzed novel extracts, proteins, and 
complexes that improve the polymerization activity of nucleic acid polymerases. 
Included within the aspects of the present invention are methods for identifying 
compositions with a polymerase enhancing activity, methods for purifying and using 
these compositions, and specific extracts, proteins, and complexes that function to 
enhance polymerase activity. 
Description of Related Art 

Manipulating nucleic acids with polymerization reactions is a fundamental 
component of biotechnology-related research. These reactions pennit researchers 
to replicate DNA or RNA in vitro, which in turn allows cloning or amplification of 
specific nucleic adds or groups of nucleic acids. Numerous other examples exist 
detailing the critical nature of a nucleic acid polymerization reaction or a nucleic acid 
polymerization enzyme in a particular technique, including sequencing nucleic 
acids, mutagenesis of nucleic acid sequences, and producing nucleic add probes 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-2- 

for hybridization. Of particular current interest are amplification reactions, such as 
PGR, that have greatly increased the rate at which researchers can perform nucleic 
acid related experimentation. Extremely rare nucleic acids can now be amplified 
and manipulated using these techniques, which necessarily involve nucleic acid 
polymerases. 

Using techniques with an amplification step has driven concern for the 
efficiency, fidelity, and sensitivity of the polymerase used. This has resulted in 
efforts to both analyze and optimize polymerization conditions for a variety of 
applications. (Lundberg et al., Gene 108: 1-6 (1991); Eckert and Kunkel, PGR 
Methods Applic. 1: 17-24 (1991); Ung et al.. PGR Methods Applic. 1: 63-69 (1991); 
Brail et al., Mutat. Res. 303: 75-82 (1994); Canity and Wold, P,N,A.S. 89: 1021- 
1025 (1992); Taylor and Logan, Gun-. Opin. Biotechnol. 6: 24-29 (1995)) In 
particular, quantitative amplification-based reactions rely upon the ability to 
efficiently amplify each nucleic acid species present in a sample. (See Ausubel, et 
al., Ghapter 15, In: Gurrent Protocols in Molecular Biology, John Wiley & Sons 
(1992) and supplements through 1995.) Thus, both a concem for the accuracy of 
and a need for new methods to enhance the performance of amplification-based 
nucleic acid techniques exists in the art 

One way in which these concerns and needs have been addressed is 
through the use of additives to the amplification reaction. Different additives act at 
different points in the amplification process. For example, fonnamide has been 
used to increase the specificity of PGR with GG rich target sequences, which are 
particulariy susceptible to intramolecular hybridization that may prevent 
hybridization with a primer. (Saricar, G. et al. Nucl. Acids Res. 18: 7465 (1990)). It 
has also been reported that tetramethylammonium chloride increases yield and 
specificity of PGR reactions, (Ghevet, E., et. al.. Nucleic Acids Res. 23:3343-3334 
(1995).) Hung et al. report the reduction in multiple satellite bands from amplifying 
complex DNA when dimethyl sulfoxide (DMSO) is added. (Hung, T., et al. Nucl. 
Acids Res, 18: 4953(1990).) The multiple satellite bands often present problems in 
purifying the desired amplification product from the other DNA present. 

Gertain proteins have been used to stabilize hybridized nucleic acids during 
replication. For example, E. coli single-stranded DNA binding protein has been 
used to increase the yield and specificity of primer extension reactions and PGR 
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reactions. (U. S. Patent Nos. 5.449,603 and 5.534.407.) The gene 32 protein 
(single stranded DNA binding protein) of phage T4 apparently improves the ability 
to amplify larger DNA fragments (Schwartz, et a!., NucL Acids Res. 18: 1079 
(1990)) and enhances DNA polymerase fidelity (Huang, DNA Cell. Biol. 15: 589-594 
(1996)). In addition, bacterial thioredoxin combined with T7 DNA polymerase 
(Sequenase™; Amersham-USB) has been used to increase processivity, but the 
combination is not active at high temperatures, such as those used in PCR. 

Another way amplification-based assays and techniques have been 
improved is through the development of modified polymerases or the use of 
combinations of polymerases. (U.S. Patent 5,566,772) For example, the TaKaRa 
long PCR kit employs two polymerases (Takara Shuzo Co., Ltd; Japan), and a 
number of polymerase combinations were also tested by Barnes (Proc. Nat. Acad. 
Sci. USA, 91:2216-2220 (1994). Truncated Taq and T. flavus DNA polymerase 
enzymes that apparently exhibit increased thermostability and fidelity in PCR have 
also been suggested. (U.S. Patent 5,436,149.) Combinations of polymerases with 
and without 5* - 3' exonuclease or 3' - 5* proofreading activity have also been used. 
(U.S. Patent No. 5,489.523) 

Further, amplification-based assays and techniques have been improved 
through empirical testing of conditions, reagents, and reagent concentrations to 
optimize polymerization reactions with a particular enzyme. Temperature and 
length of amplification cycles, primer length, and pH, for example, are ail conditions 
that can be optimized. (Barnes, Proc. Nat. Acad. Sci. USA. 91:2216-2220 (1994).) 

However, accessory proteins can be even more useful in improving 
polymerase activity and/or the processivity of polymerases. "Processivity" in this 
context refers to the number of enzymatic reactions occumng each time an enzyme 
binds to its substrate. In the context of nucleic acid replication reactions, 
"processivity" means the number of bases that can be replicated when the 
polymerase binds to a priming site. An increase in processivity directly relates to 
longer replication products. 

Intracellular replication has been shown to involve accessory proteins, as 
characterized in E. coli, human, and phage T4 systems. The accessory proteins 
interact with polymeras s to improv activity and provide the high processivity 
necessary to replicate genomic DNA efficiently while avoiding unacceptable 
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mutation rates. Sine the accessory proteins can be used in combination with the 
other improvements noted above, the development and application of accessory 
proteins holds particular promise for enhancing the results of nucleic acid 
replication-based reactions. 

Accessory proteins have been identified in eukaryotes. E. coli. and 
bacteriophage-T4 and are thought to form "sliding clamp" structures. (Kelman and 
O'Donnell. Nucl. Adds. Res. 23(18): 3613-3620 (1995).) These structures are 
thought to tether the polymerase to DNA. thereby increasing processivity. The 
sliding clamp structures, however, have largely been studied in in vitro model 
systems. Only in the case of T4 polymerase has knowledge of the activity of such 
accessory proteins been used to improve polymerization-based techniques 
employed by researchers in the art. For example, accessory proteins of the T4 
holoenzyme have been reported to improve processivity when added to 
polymerization systems using T4 polymerase. (Young et al.. Biochem. 31(37): 
8675-8690 (1992); Oncor Fidelity^ Sequencing System. Oncor; Gaithersburg. 
MD.) However, since the T4 accessory proteins are derived from bacteriophage, 
they are not likely to enhance polymerases from bacteria, archae. or eukaryotes. 
Thus, the use of T4 accessory proteins is believed to have been limited to 
techniques where T4 polymerase is used. 

The presence of dUTP (deoxyuracil triphosphate) in a polymerization 
reaction and the effect of deoxyuridine-containing DNA on DNA synthesis have also 
been examined. In particular, deoxyuridine in a DNA strand has been shown to 
inhibit polymerization by archael DNA polymerases. (Lasken. et a/.. (1996) J. Biol. 
Chem. 271; 17692-17696.) While Lasken ef al. reported that archeal DNA 
polymerases, such as Vent, are inhibited by DNA containing deoxyuridine, they do 
not discuss the effect of removing uradl-containing nudeosides or nudeoside 
triphosphates from the readion to prevent incorporation. Furthermore, they do not 
discuss any enzyme that acts on or turns over dUTP in a reaction. Neither do they 
mention any dUTPase adivity or the possible effed of dUTPase activity on 
polymerization reactions. In addition. Lasken et al. do not appredate the fad that 
dUTP is generated during the course of a normal PCR reaction by the deamination 
of dCTP. As a result of the deamination. dUTP will be present and be incorporated 
into an amplified nudeic acid, inhibiting the polymerase adivity. Thus, the art has 
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not appreciated the potential of dUTPase activities and proteins in enhancing 
replication reactions. 

Accordingly, since present knowledge and use of accessory proteins has led 
to limited applications in replication-based techniques, there continues to exist a 
need in the art for new and more widely useful compositions for enhancing 
polymerase enzyme activity. The present invention meets this need. 

Summary flfth^ InVftfltlftn 
The present invention comprises extracts, protein complexes, and related 
proteins that possess nucleic acid polymerase enhancing activity useful in a variety 
of replication reactions known in the art. Thus, the extracts, protein complexes, and 
related proteins of the invention function to enhance a wide spectrum of in vitro 
nucleic acid replication reactions by providing, inter alia, replication products of 
superior length, fidelity or both, and at higher yields. As used in this specification 
and appended claims "polymerase enhancing activity" means the ability to increase 
the rate, fidelity, and/or yield of a nucleic acid polymerization reaction mediated by a 
nucleic add polymerase, or to expand or alter the range of conditions under which 
such reaction does or may proceed. 

In one aspect of the invention, extracts of Pyrococcus furiosus (Pfu) cells 
are provided that enhance the activity of Pfu DNA polymerase. The extracts 
enhance nucleic acid replication product yields over a fairly broad range of 
concentrations and contain at least one polymerase enhancing factor. As used in 
this specification and in the appended claims, the term "PEP includes purified 
naturally occurring polymerase enhancing factors and wholly or partially synthetic 
copies or active analogs thereof. In accordance with the invention, such extracts 
can be further purified by heparin affinity chromatography followed by sepharose 
gel purification. Additionally. PEFs can be identified and purified using the 
antibodies of this invention, discussed below. While Pfu cell samples were used 
and are specifically exemplified below, one skilled in the art will appreciate that 
other cell samples can be used to identify and purify PEFs. For example, other 
species of the archae Pyrococcus or Thermococcus can be used as well as 
thermophilic bacteria cells and other bacteria cells. In addition, eukaryotic cells and 
tissues can be used as a source for PEF, as demonstrated by the cloning and 
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expression of human dUTPase. which also enhances polymerase activity. Thus, 
the invention also comprises compositions and methods wherein a dUTPase or any 
activity that tums^jver dUTP is capable of acting to enhance a nucleic acid 
polymerization reaction. 

In another aspect of the invention. PEF complexes are provided. The PEF 
complexes of the invention possess polymerase enhancing activity and generally 
comprise multiple protein subunits with a combined molecular weight of 
approximately 250kD or above as determined by SDS-PAGE analysis and gel 
filtration of unheated PEF samples. An example of one PEF complex (P300) was 
purified from Pfu cell sample extracts. The predominant components of the 
complex are a 50kD protein (P50) and a 45kD protein (P45). Heat treating the Pfu 
P45 with 2% SDS and 1% TCA produces a 17-18kD protein, which represents the 
fully denatured form. However, the Pfu PEF complex contains other minor 
components with approximate apparent molecular weights of 150, 100. 85. 60. 55. 
42. and 37kD. At least two components (150 and 100) have been shown to be 
dimeric or polymeric forms of P50. Thus, the PEF complexes of the invention 
comprise protein components and function to enhance the activity of polymerases. 

In another aspect of the invention. Pfu proteins possessing polymerase 
enhancing activity are provided. These proteins have molecular weights between 
approximately 42 and 60kD by SDS PAGE analysis under partially denaturing 
conditions. The 42-60kD proteins may be used alone or in combination to enhance 
polymerase activity. Methods for purifying these proteins as well as the PEF 
extracts and PEF complexes from which they have been isolated are also provided. 

The invention also involves two particular proteins, Pfu P50 and P45. which 
are predominant components of the PEF complex (P300). Detailed structural and 
functional infonnation on the Pfu P45 and P50 proteins is disclosed. The P50 
protein is similar in structure to a bacterial flavoprotein. The P45 protein is similar in 
structure to dCTP deaminase, functions as a dUTPase. and possesses polymerase 
enhancing activity. The structural information herein can be used to generate 
specific hybridization probes that detect the presence of nucleic acids encoding a 
protein that is part of a PEF complex, or related proteins from samples from other 
species, or possesses PEF activity. Furthermore, the structural information can be 
used to generate proteins from expression systems known in the art, synthetic 
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proteins, partially synthetic proteins, or proteins made from a combination of natural 
proteins, expressed proteins, and synthetic proteins. Methods for detecting the 
presence or absence of polymerase enhancing activity and/or dUTPase activity are 
also included in this invention and can be used to identify the various active PEF 
proteins or analogs. In addition, polyclonal or monoclonal antibodies that bind to 
PEF components can be produced, for example from purified P45 or P50. purified 
PEF complexes (P300). or another PEF of the invention. These antibodies can 
then be employed in assays and kits, well known in the art. in order to identify the 
presence or absence of a PEF. 

The understanding of the catalytic activity of PEF. and the P45 protein in 
particular, provides aspects of this invention directed to polymerase enhancing 
proteins, as well as methods, kits, and compositions containing a dUTPase activity 
or dUTPase protein as a PEF. Thus, a dUTPase activity or dUTPase protein or 
composition can be used to enhance nucleic acid replication, polymerization, or 
PCR reactions according to this invention. In fact, any activity that functions to turn- 
over dUTP can be used as a polymerase enhancing activity of this invention. Wide- 
ranging sources for the dUTPase activity, protein, or composition exist, as it is 
demonstrated to be present from both archael and human sources, the ends of the 
phylogenetic possibilities. Thus, any cell or species can be used as a source for 
polymerase enhancing activity or PEF. 

Kits for replicating nucleic acids and methods for using the PEF complexes, 
specific proteins of the complexes, and extracts containing PEF are also provided. 
In addition, the complexes, proteins, and extracts can be used in compositions 
comprising a polymerase. Ideally, the polymerase will be one that is enhanced by 
the complex, protein, or PEF. The PEF extracts, complexes and proteins of the 
present invention are particulariy useful in mixtures with nucleic acid polymerases, 
such as native polymerases, those produced by recombinant DNA techniques, and 
kits containing such polymerases. 

Also provided in the invention are methods for identifying proteins or 
complexes that influence nucleic acid polymerases. The source of the protein can 
be any bacterial, archael. or eukaryotic species. Certain embodiments involve 
methods for identifying proteins affecting polymerases used in amplification 
reactions, for example, alpha-type DNA polymerases such as DNA polymerases 
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from Pyrvcoccus and Thermophilis species. Other embodiments involve the 
analysis of dUTPase activity as well as computer implemented screening methods 
to identify a PEF. 

Deiserintiffn fff the DraMfinflB 
Figure 1. PGR enhancing activity in a heparin sepharose column fraction. 
PGR enhancing activity was measured using the 6.2 kb system described in 
example 1. Golumn fraction SGS #36 H.S. #78 (prep. 2) was diluted in 1 x cloned 
Pfu PGR buffer and Ipl aliquots of the following were added to lOOpI PCRs. Figure 
1: lane 1, buffen lanes 2.4 column fraction diluted 1:10; lanes 3.5 column fraction 
diluted 1:100. In Figure 1. lanes 4 and 5. ^AA742 template DNA has been omitted 
from the reactions. 

Figure 2. PGR enhancing activity in a heparin sepharose column fraction. 
Figure 2: lane 1, buffer lanes 2,5 column fraction undiluted; lanes 3,6 column 
fraction diluted 1:10; lanes 4.7 column fraction diluted 1:100. In Figure 2. lanes 5- 
7, cloned Pfu DNA polymerase has been omitted from the PGRs. 

Figure 3. PGR enhancing activity of SDS-PAGE gel-purified samples from 
heparin sepharose fraction SGS #36 H.S. #78 (prep. 2). Figure 3 shows the PGR 
enhancing activity of proteins eluted from SDS-PAGE gel slices 1-7 (native Pfu 
DNA pol.; lanes 1-7) and 9-20 (H.S. #78; lanes 9-20 from Figure 4). The proteins 
were eluted as described in Example 2. One (1) pi of each gel slice, diluted 1:100 
in cloned Pfu PGR buffer, was added to cloned Pfu PCRs as described in Example 
1 (6.2kb primer-template system). In the left lanes of the gel is shown PGR product 
synthesis in the presence of Ipl of buffer (-) or H.S. #78, diluted 1:1000 (.OOlpi). 
1:10,000 (.OOOlMl). or 1:100.000 (.OOOOImI). DNA mari<ers were electrophoresed in 
lane 'm". 

Figure 4. Figure 4 shows a duplicate of the master SDS-PAGE gel from 
which gel slices were excised. The following proteins were electrophoresed on both 
gels: lane A. 8 pi of native Pfu DNA polymerase (lot #24); lane B, pre-stained 
molecular weight markers (Novex); lane G. heparin sepharose fraction SGS #36 
H.S. #78 4mI (»160 ng PEF). The samples were not pre-heated before loading, and 
the duplicate gel shown here was silver-stained. Gel slices 1-7 were recovered 
from lane A on the master gel. whil slices 8-25 were recovered from lane G of the 
master gel. 
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Flgure 5. DNA polymerase activity in SDS-PAGE gel purified samples. The 
level of DNA polymerase activity (cpms incorporated) in gel slice eluates dpi) was 
measured as described in Example 14. The polymerase activity exhibited by gel- 
purified proteins numbered 1-7 in the native Pfu DNA polymerase preparation 
(Figure 4; lane A) is shown by the solid line (left Y axis). The polymerase activity of 
gel-purified proteins numbered 8-25 In fraction H.S. #78 (Figure 4. lane C) is shown 
with the broken line (right-handed Y axis). The apparent molecular weights of the 
proteins tested are shown on the x axis (at the top) and are inferred from the 
position the gel slices were recovered, relative to pre-stained molecular weight 
markers. Gel slices #1 1 and 18 exhibited the highest PGR enhancing activity. 

Figure 6. SDS-PAGE analysis of gel-purified P. fuhosus PEF. The following 
samples were electrophoresed on 4-20% acrylamide gels in the presence of 1% 
SDS: lanes 2.3- SCS #37 H.S. fraction #75 (prep. 4; lOpI of a Microcon 30 
concentrated fraction: =100 ng PEF) lanes 4,5 lOjjl of SDS-PAGE gel slice eluate 
recovered from SCS #37 H.S. fraction #75 (migrated 0-4mm above 250kD protein 
marker). The samples in lanes 3 and 5 were boiled for 3 minutes before loading 
onto the SDS-PAGE gel. The completed gel was stained with silver stain. The 
migration of pre-stained molecular weight markers is shown in lane 1 . 

Figure 7. SDS-PAGE analysis of S200-purified P. furiosus PEF. lOpI (2.25 
Mg) of S200-purified PEF (prep. 1) was incubated in the presence of 2% SDS for 3 
minutes at room temperature (RT). 75°C, 85'C, or 100'C (boil), as indicated on the 
lanes, and then subject to electrophoresis as described in Example 2. Proteins 
were detected by silver-staining. Protein molecular weight markers were run in lane 



m 



Figure 8. PCR enhancing activity of SDS-PAGE gel-purified samples from 
an S200-purified P. furiosus PEF preparation (prep 1). Purified PEF (=4.5 pg) was 
pre-incubated in 2% SDS for 3 minutes at room temperature (RT) or at 85°C (85°C) 
before electrophoresis. Proteins were eluted from SDS-PAGE gel slices 1-13 (gel 
RT) and 14-26 (gel 85'C) as described in Example 2. One (1) pi of each gel slice, 
diluted 1 :100 in cloned Pfu PCR buffer, was added to cloned Pfu PCRs as 
described in Example 1 (6.2kb primer-template system). The approximate 
molecular weights of the SDS-PAGE gel-purified proteins is indicated at the top of 
the gel. DNA size mart<ers were run in lanes "m". 
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Figure 9. S200.purified PEF (1 .4 pg total protein) was heated at 85 or 
100«C prior to electrophoresis. Four slices were excised from the gel between the 
60kD (slice 1 ) and 42kD (slice 4) markers. The proteins were eluted in SOpI of 
buffer and lOpI aliquots were boiled in the presence of SDS-BME loading dye and 
run out on 4-20% PAGE gels. Lane1: Novex pre-stained markers: lane 2: 0.28 nj 
PEF; lanes 3-6: gel purified proteins isolated from S200-purified PEF heated at 
85'C- slice 1 (lane 3). slice 2 (lane 4). slice 3 (lane 5). and slice 4 (lane 6): lanes 7- 
10: gel purified proteins isolated from S200-purified PEF heated at 100°C: slice 1 
(lane 7), slice 2 (lane 8), slice 3 (lane 9), and slice 4 (lane 10), 

Figure 10. SDS-PAGE analysis of P. /uriosus PEF preparations. In the left 
panel, the following samples were subject to SDS-PAGE silver stain analysis, 
conducted as described: cloned Pfu DNA polymerase lot#24A (56 ng/pl) [lanes 1-3. 
iMl. 2mI. 4mI]; S200 purified P. furiosus PEF fraction #46 (prep. 1 from SCS #38) 
[lane 4- 2mI]; S200 purified P. furiosus PEF pool fractions #47-48 (550 ng/pl; prep. 3 
from SCS #38)[lanes 5-8. .Ipl, .2mI, .4mI, Ipl]. In the right panel, the following 
samples were run: cloned Pfu DNA polymerase lot#24A (56 ng/yl) [lanes 1-3, 
2mI. 4mO: microcon 30-concentrated SCS #36 heparin sepharose fraction #78 (prep. 
2) [lanes 4-6, Ipl. 2mI. 0.4|jO; SCS #37 heparin sepharose fraction #75 (prep. 4) 
[lanes 7-8. 1 Ml. Spl). 

Figure 11. PDVF blots of P. furiosus PEF. PEF-containing heparin 
sepharose fractions (from SCS #37 Pfu purification) were concentrated and aliquots 
electrophoresed in 8 or 9 lanes on 4-20% SDS-PAGE gels as described. The 
samples were boiled for 3 min. prior to loading to recover the 50kD monomeric PEF 
(top) or were loaded in the absence of heat treatment to recover the >250kD 
aggregate (bottom). The proteins were transferred to PDVF filters (BioRad) and 
stained with Amido black. 

Figure 12. PVDF blot of S200-purified P. furiosus PEF. Approximately 
20Mg of total protein was electrophoresed in each of 7 lanes on a 12% PAGE gel. 
The samples were heated at 85 'C for 5 minutes prior to loading. The proteins were 
transfen^d to PVDF filters and stained as in the Figure 1 1 legend. 

Figure 13. Absorbance spectrum of S-200 purified P. furiosus PEF. The 
spectra of the following were obtained on a Shimadzu UV160U spectrophotometer: 
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0.7 mg/ml P. furiosus PEF. 9.6mM and 4.8mM riboflavin, and PEF final storage 
buffer (blank). 

Figure 14. Recombinant P45 (rP45) amplification enhancement of 5.2 kb 
fragment. In the 5.2 kb "On/OfT assay with PEF and rP45 samples. 7. 0.7 or 0.2 nc 
of the native PEF protein or 5mL 1 or 1 mI of a 1/10 dilution of the rP45 protein 
(20 ng/ul) was added to the on/off assay. 

Figure 15. SDS-PAGE analysis of fully denatured native PEF and rP45. 
The following protein samples were electrophoresed on 4-20% SDS-PAGE gels as 
described in the text: 100 ng native PEF. boiled in 2%SDS without (lane 1) or with 
1%TCA (lane 2); 200ng recombinant P45 clone 1, boiled in 2%SDS (lane 3) or 
loaded in the absence of heat-treatment (lane 5); 200ng recombinant P45 clone 23, 
boiled in 2%SDS (lane 4) or loaded in the absence of heat-treatment (lane 6). The 
migration of molecular weight markers is shown in lane 7 (far right). The gel was 
stained with SYPRO orange dye (Molecular Probes), diluted 1:5000 in 5% acetic 
acid. 

Figure 16. Reverse Phase HPLC Analysis of dUTP Conversion by PEF 
and rP45. Panel A. dUTP heated at 72X in 1x cloned Pfu polymerase buffer 
(negative control). Panel B. dUTP heated in 1x cloned Pfu polymerase buffer with 
700 ng native PEF. Panel C, dUTP heated in 1x cloned Pfu polymerase buffer with 
5 Ml of rP45 (clone 1). Panel D, dUMP standard heated in 1x cloned Pfu buffer, 
Panel E, the products shown in panel B and panel D were mixed and injected 
simultaneously to demonstrate, through the production of a single peak, that dUMP 
migrates identically to the PEF byproduct. 

Figure 17. Reverse Phase HPLC Analysis of dCTP Conversion by PEF and 
rP45. Panel A, dCTP heated in 1x cloned Pfu polymerase buffer without additive 
(negative control). Panel B, dCTP heated in 1x cloned Pfu polymerase buffer with 
700 ng of PEF. Panel C. dCTP heated in 1x cloned Pfu polymerase buffer with 5 pi 
of rP45. Panel D. dUTP and dCTP heated in 1x cloned Pfu polymerase buffer with 
700 ng of PEF. Chromatograms in panels A, B and D, were performed in the same 
series, while chromatogram C was performed in a different experiment, using a 
different C-18 column. In the second experiment a dCTP standard was shown to 
elute at 9.6 minutes (data not shown). 
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Flgure 18. Reverse phase HPLC analysis of the products generated by 
heat-treatment of dCTP. dCTP (10mM in cloned Pfu DNA polymerase PCR buffer) 
was heated at 95^C in thin-walied tubes in a RoboCycler 40 temperature gradient 
block. Incubations were earned out for the following length of time: 0 (panel A), 1 
hour (panel B). or 4 hours (panel C). lOpI of the reaction mixtures were loaded 
onto a Waters Delta-pak 0-18 column (300 angstrom/1 Spm). equilibrated in 50mM 
triethylammonium acetate, pH{7.0) (A). Products were eluted with acetonitrile (B) at 
2 ml/min.. using the following gradient: 0% B for 5 minutes, 0-10% B over 20 
minutes. Absorbence of the eluate was monitored with a photodiode amy detector, 
and peak areas were integrated at 260nm, To the right of panels A and C are 
shown the same chromatograms re-integrated at a higher sensitivity. In panel D. 
lOpI of lOmM dUTP (in cloned Pfu DNA polymerase PCR buffer) was 
chromatographed. 

Figure 19. Comparison of the absorbence spectrum of dCTP, dUTP, and 
the product generated from heating dCTP. The absorbence spectra (200-300nm) 
of the following were superimposed: (1 ) peak 9.283 from chromatogram A in Figure 
18 (dCTP unheated); (2) peak 12.383 from chromatogram C in Figure 18 (heated 
dCTP product); (3) peak 12.100 from chromatogram D in Figure 18 (dUTP 
unheated) The three spectra were normalized such that heights of the maximally 
absorbing peaks (Amax) are equivalent. 

Figure 20. Reverse phase HPLC analysis of the products generated by 
PCR cycling dCTP. dCTP (lOmM in cloned Pfu DNA polymerase PCR buffer) was 
cycled alone (Panel A) or in the presence of 0.5 ng/pl PEF (Panel B) or 1,25U/mI 
Pfu DNA polymerase (Panel C). Cycling was earned out in a RoboCycler 40 using 
the following conditions: 95'C 1 min./30 cycles of: 95"C 1 min.,60°C 1 min..72'C 
10:24 min./ 72^ 10 min. lOpI of the reaction mixtures were chromatographed as 
described in the Figure 18 legend. In panel D, the absorbence spectra of dCTP, 
dUTP. and the product generated from PCR cycling dCTP are compared. The 
absorbence spectra (200-300nm) of the following were superimposed: (1) major 
peak at 8.3-8.8 min. from chromatogram of dCTP + Pfu unheated; (2) peak at 
1 1.867 min. from chromatogram of dUTP in buffer unheated; (3) peak 12.150 from 
chromatogram C (Figure 20) showing dCTP + Pfu cycled. 
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Flgure 21. dUTP Inhibition and Reversible Inhibition with PEF and rP45. 
The following components were added to cloned Pfu DNA Polymerase PCR 
reactions (5.2kb "on/or assay of example 1): Lane 1, 2 pM dUTP + 7 ng native 
PEF. Lane 2, 2MMdUTP. Lane 3, 2 pM PEF generated dUMP. Lane 4. 20 pM 
PEF generated dUMP. Lane 5. 2 pM dUTP + 180 ng rP45 done 1. Lane 6. 2 pM 
dUTP + 20 ng rP45 done 1. Lane 7, 2pM dUTP + 2 ng rP45 done 1. Lane 8. 2 pM 
dUTP + 180 ng rP45 done 23. Lane 9. 2 pM dUTP + 20 ng rP45 clone 23. Lane 
10. 0.2pM dUTP. Lane 11.0.2pMdUTP + 7ng native PEF. Lane 12, 0.2pM dUTP 
+ 20 ng rP45 done 1. Lane 13, 0.2pM dUTP + 20 ng rP45 done 23. In the right 
Lane M, 1 Kb DNA maricers were run. 

Figure 22. Inhibition of PCR with unrelated dU Containing Oligonudeotides. 
A 0.9kb target was amplified from human genomic DNA. Lanes 2 and 3 represent 
the PCR amplification without the addition of a third oligo. The sample in lane 3 
was amplified in the presence of 7ng of PEF. Lanes 4-8 and 10-11 represent the 
titration of the control (dT rather than dU) oligonucleotide (40mer). Lanes 13 
through 32 represent samples amplified in the presence of the dU oligonudeotide 
{40mer). Lanes 13-19 were amplified with Pfu DNA polymerase alone. Lanes 14 
through 25 were amplified with Pfu DNA polymerase in the presence of 7 ng of 
PEF. Lanes 27 through 32 were amplified with Taq DNA polymerase. 

Figure 23. Human dUTPase mimics PEF/P45 activity. PEF activity was 
measured using the 5.2kb primer-template assay described in example 1. At each 
annealing step of the PCR reaction. O.Spl of the following were added: dUTPase 
storage buffer (negative control), recombinant P45 (at 2ng/pl)(positive control), 
human dUTPase preparation undiluted (duplicate reactions), or diluted 1:10 
(duplicate reactions). 

Figure 24. Presence of PEF(P300) in P. furiosus DNA polymerase 
preparations. SDS-PAGE analysis is shown for six preparations of native Pfu DNA 
polymerase with varying levels of PEF present. The volumes of each lot loaded is 
indicated (pi). The protein samples were not boiled prior to electrophoresis and the 
gels were silver-stained.. 

Figure 25. Western blot analysis of crude extracts using P. furiosus PEF- 
spedfic IgG. Extracts were prepared from 5 different fermentations of P. furiosus 
(lanes 1-5). 3 partially purified fractions from T. aquaticus (lanes 6-8), and 1 extract 
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from £ CO// (lane 9). Purified PEF (550 ng) and pre-stained molecular weight 
markers were run in lanes 10 and 11. respectively. With the exception of the 
markers, all samples were boiled in SDS/BME dye prior to loading. 

Figure 26. Western blot analysis of native PEF samples using anti- 
recombinant P45 antibody. Three samples (700 ng each) of native Pfu PEF were 
prepared as follows: unheated/2% SDS loading dye (lane 1); heated/2% SDS 
loading dye (lane 2); and heated/1 %TCA/2% SDS loading dye (lane 3). Heating 
was carried out for 5 minutes at 95°C. The samples were electrophoresed on a 4- 
20% gradient gel and transferred to nitrocellulose. The blot was probed as 
described in the text with rabbit anti-rP45 sera. Novex prestained molecular 
mari<ers (SeeBlue™) were run adjacent to the PEF samples in lane 4. 

Figure 27. Western blot using the antibody to CBP-rP45 and cell lysates 
from different species. The lysates shown in lanes 2-7 were prepared by 
sonication. The samples were heated at SS'C prior to loading except for the sample 
loaded in lane 10 which was left at room temperature. Lane 1. SeeBlue™ Mariners 
(Novex), Lane 2. Uncharacterized Bacillus species (Magenta), lane 3, Thermus 
aquaticus. lane 4. uncharacterized species, Lane 5. Pymcoccus furiosus, Lane 6, 
Methanobacterium thermoautotrophicum, lane 7. Human lymphoma HL60 cells. 
Lane 8 purified Pymcoccus furious PEF. Lane 9 purified fusion CBP-rP45. Lane 10 
purified fusion CBP-rP45. 

Figures 28, 29, and 30. PGR enhancing activity of P. furiosus PEF in 
cloned Pfu DNA polymerase PCRs. PGR amplifications were performed as 
described in example 14 with the following additional notes. Figure 28. A 1.9 kb 
/acl-/acZa target was amplified from plasmid DNA. lOOAil PGRs were conducted 
with 50 pg of pPRIAZ. 100 ng of primers (5'GAT AGC GAA TTC GCA AAA CGT 
TTC GCG GTA TGG 3' (SEQ. ID NO: 20); 5'ACT ACQ GAA TTC GAG GGA AAA 
TGC CGC TCA TCG 3* (SEQ ID NO: 21)). and 5U cloned Pfu DNA polymerase in 
the absence (duplicate samples #18) or the presence (duplicate lanes #19) of O.Spl 
of a PEF-containing heparin sepharose fraction (H.S. #75; prep. 4; » 10 ng/pl PEF). 
PGR cycling was conducted on a GeneAmp PGR System 9600 (Perkin Elmer 
Getus) using the following conditions: 30s at 95''G (1 cycle)/ 5s at SS'C; 1 min. at 
55"G; 2.5 min. at 72»G (30 cycles). Figure 29. A lOkb target from lambda DNA 
was amplified. lOOpI PGRs were conducted with 250 ng of lambda DNA (Sigma), 
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250 ng of primers (F51.20 S-GGC-GTr-TCC-CTT-CTT-CTT-CG 3' (SEQ ID NO: 
22) R10163.20 S'CCA-TCT-CAC-GCG-CCA-GTT.TC 3' (SEQ ID NO: 23)). and 5U 
cloned Pfu DNA polymerase in the absence (lane 1) or the presence of Ipl of a 
S200-purified PEF (prep. 3; 550 ng/pl PEF) diluted 1:500 (lane 2). 1:50 (lane 3). or 
1:5 (lane 4). PGR cycling was conducted on a Robocyder 40 (Stratagene) using 
the following conditions: 95'C for 1 min. (1 cyciey 95'C for 1 min.; 62'C for 1 min.; 
72*C for 10 min. (30 cycles). Figure 30. Lanes 1-5. a 5.2 kb portion of the human 
a1 antitrypsin gene was amplified from genomic DNA. 25 pi PCRs were conducted 
with 62.5 ng of human genomic DNA (Promega). 50 ng of primers (F91.23 S'GAG 
GAG AGO AGG AAA GGT GGA AC (SEQ ID NO: 24); R5271-21 5'GCT GGG 
AGA AGA CTT CAC TGG) (SEQ ID NO: 25). and 0.6U cloned Pfu DNA 
polymerase in the absence (lane 1) or the presence of Ipl of S200 purified PEF 
(SCS #52: 0.7 pg/ul) diluted 1:1000 (lane 2). 1:10,000 (lane 3). 1:100.000 (lane 4) 
or 1 :1 .000.000 (lane 5). PGR cycling was conducted on a RoboCycler 96 
(Stratagene) using the following conditions: 96''C for 45s (1 cycle)/ 96*C for 45s; 
60'C for 45s; 72''C for 14 min. (35 cycles)/72'C for 10 min. (1 cycle). 

Figure 31 . Effect of Stratagene's Perfect Match (£ coli ssb) on the 
specificity of PCRs conducted with cloned Pfu DNA polymerase and P. furiosus 
PEF. PCRs were conducted using the 6.2kb test system (example 1) in IOOmI 
reaction volumes. Ipl of the following were added to PCRs: lanes 1 ,5-dilution 
buffer; lanes 2.6- undiluted PEF-containing heparin sepharose fraction (microcon 
30-concentrated SCS #36 H.S. #78. prep. 2; »40 ng/pl PEF); lanes 3.7- PEF 
fraction diluted 1:10; lanes 4.8- PEF fraction diluted 1:100. lyl of Perfect Match was 
added to PCRs run in lanes 5-8. No DNA samples were loaded in the lanes 
between lanes 3 and 4. 4 and 5, and 7 and 8. 

Figure 32. SDS-PAGE analysis of heparin sepharose (H.S.) column 
fractions. The heparin sepharose fractions indicated (# at top) from SCS native Pfu 
DNA polymerase prep. #37 (SCS #37) were analyzed on 4-20% SDS-PAGE gels 
(4Ml/lane). Prestained molecular weight mariners were run in lanes denoted "m" and 
4mI of SCS #36 H.S. #78 (PEF prep. 2) was run in lanes mari«ed "A". The peak of 
DNA polymerase (95kD) activity eluted between fractions 50 and 61. 

Figures 33, 34, 35, and 36. Enhancing activity of P. furiosus PEF in PCRs 
conducted with Pwo (Figure 33). JDF-3 (Figures 34 and 35). ES4 and Vent (Figure 
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35). and Deep Vent (Figure 36) DNA polymerases. In Figure 33. 25pl PCRs were 
conducted using the 6.2kb test system (example 1) with 25U/ml Pwo DNA 
polymerase and 1 x cloned Pfu PCR buffer. Ipl of S200-purified P. fuhosus PEF 
(prep 1; 225 ng/pl) was added undiluted (lane 1) or diluted 1:10 (lane 2). 1:1000 
(lane 3). 1:10.000 (lane 4). ijjj of dilution buffer was added as a negative control 
(lane 5). In Figure 34. a 10 Kb lambda DNA target was amplified from lambda DNA 
(lanes 1-4; 5-7; 11-13) or mouse genomic DNA. containing 40 copies (lanes 8-10) 
or 1 copy (lanes 14-16) of a lambda DNA transgene. 100^/1 PCRs were conducted 
with 250 ng of lambda DNA (Sigma) or mouse genomic DNA. and 250 ng of primers 
(F51-20 5'GGC-GTT-TCC-GTT-CTT-CTT-CG (SEQ ID NO: 22); R10163-20 
5'CCA-TCT-CAC-GCG-CCA-GTT-TC)(SEQIDN0: 23). PCRs were conducted in 
Taq PCR buffer using 5U Taq DNA polymerase (lanes 1-4) or 1U JDF3 DNA 
polymerase (lanes 5-16). 1 \i\ of the following was added to PCRs: S200-purified 
PEF (prep. 3; 550 ng/pl PEF) diluted 1:500 (lanes 2. 12. 15). 1:100 (lanes 6.9). 1:50 
(lane 3). 1:10 (lanes 7.10.13.16). or 1:5 (lane 4). ipl of dilution buffer was added as 
a negative control (lanes 1.5.8.11.14). PCR cycling was conducted on a 
Robocycler40 (Stratagene) using the following conditions: 95°C for 1 min. (1 cycle)/ 
95»C for 1 min.; 62"'C for 1 min.; 72^ for 10 min. for Taq or 5 min. for JDF3 (30 
cycles). In Figure 35, the 1.9kb ligase gene was amplified from P. fuhosus 
genomic DNA. IOOjjI PCRs were conducted with 250 ng of DNA and 250 ng of 
primers (5'GAG CTT GCT CAA CTT TATC (SEQ ID NO: 26); 5'GAT AGA GAT 
AGT TTC TGG AGA CG) (SEQ ID NO: 27). PCRs were conducted with 10U ES4 
DNA polymerase in Pfu PCR buffer (lanes 1.2). 1.5U JDF3 DNA polymerase in Taq 
PCR buffer (lanes 3.4). 4U Pfu DNA polymerase in cloned Pfu PCR buffer (lanes 
5.6). 1U Vent DNA polymerase in Vent PCR buffer (lanes 7.8). or 1U Taq DNA 
polymerase in Taq PCR buffer (lanes 9.10). Ipl of the following was added to 
PCRs: dilution buffer (lanes 1.3.5,7.9) or S200-purified PEF (prep. 3; 550 ng/pl 
PEF) diluted 1:100 (lanes 2.4.6.8,10). PCR cycling was conducted on a DNA 
Thermal Cycler 480 (Perkin Elmer Cetus) using the following conditions: 95'C for 1 
min.; 46X for 1 min.; 72'C for 2 min. (30 cycles). In Figure 36, a 2kb DNA target 
was amplified from transgenic mouse genomic DNA. lOOpI PCRs were conducted 
with 250 ng of DNA and 250 ng of primers (F51-20 5'GGC GTT TCC GTT CTT 
CTT CG (SEQ ID NO: 22); R2092-23 5'CGG GAT ATC GAC ATT TCT GCA CC) 
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(SEQIDNO: 28). PCRs were conducted with 0.75U Deep Vent DNA polymerase 
in Deep Vent PGR buffer (lanes 1-4). Ipl of the following was added: dilution buffer 
(lane 1) or S200-purified PEF (prep. 3; 550 ng/pl PEF). diluted 1:500 (lane 2). 1:100 
(lane 3). 1 :50 (lane 4). PGR cycling was conducted on a Robocycler40 
(Stratagene) using the following conditions: 95°G for 1 min.(1 cycle)/95»C for 1 
min.; 62'C for 1 min.; 72^ for 2 min. (30 cycles). 

Figure 37. Enhancement of RT-PGR with P. furiosus PEF. A portion of the 
EGF receptor sequence was amplified from HepGj total RNA using the 2-step RT- 
PGR protocol described with the following additions. 1 pi of a cDNA synthesis 
reaction was added to 50pl PGR reactions containing 5U Pfu DNA polymerase, 
1pM antisense primer (5' GAG-TTA-AAT-GCC-TAC-ACT-GTA-TCT) (SEQ ID NO: 

29) . IpM sense primer [5- CAG-GAC-TGA-GAA-GGT-GGT-ATC-GAA (SEQ ID NO: 

30) (Ikb product) or 5'CTG-GAG-GTG-GCG-TGT-AGG-ATT-TGT (SEQ ID NO: 31) 
(3kb product)]. 1 pi of buffer (0) or VI of a PEF-containing heparin sepharose 
fraction (SCS #37 H.S. #75; prep. 4; =10 ng/pl PEF) was added undiluted (UD) or 
diluted 1:10 to 1:10* (as indicated). PCRs were conducted in cloned Pfu PGR 
buffer, containing 3mM MgSO,. 

Figure 38. Enhancement of Pfu DNA polymerase-based Seamless Cloning 
with P. furiosus PEF. 20 ng of plasmid was amplified as described in the Seamless 
Cloning kit protocol in the absence (lanes 2,4.6,8.10,12) or presence (duplicate 
lanes 1.3.5.7.9,11) of S200-purified P furiosus PEF (prep. 3; 550 ng/pl), diluted 
1:100. A 7.2 kb target was amplified with 6 different primer pairs (each set of 3 
lanes). An extension time of 7.5 minutes was used for PEF-containing reactions, 
while an extension time of 15 minutes was used for reactions lacking PEF. 

Figure 39. Enhancement of Pfu DNA polymerase-based QuikChange 
mutagenesis with P. furiosus PEF. QuikChange mutagenesis was performed using 
the kit control primers and plasmid template, with either native or cloned Pfu DNA 
polymerase. To the reactions was added 1pl of dilution buffer or 1 pi of a PEF- 
containing heparin sepharose fraction (SCS #36 H.S. #78; prep. 2 = 40 ng/pl). 
diluted as indicated. The relative amount of linear amplification product was 
assessed by the intensity of ethidium bromide-stained product bands on agarose 
gels. Supercompetent £. coli cells were transformed with the digested amplification 
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products. The number of amp resistant colonies and the mutation frequencies were 
scored. 

Figures 40 and 41. PGR enhancing activity of S200-purified P. furiosus 
PEF. PGR enhancing activity was measured in duplicate assays using the 6.2 kb 
test system described in Example 1. In Figure 40. purified PEF (225 ng/pl; prep. 1) 
was diluted in 1 x cloned Pfu PGR buffer and 1 pi aliquots of the following were 
added to 24MI PCRs- lane 1 . buffer; lane 2. PEF undiluted: lanes 3-6. PEF diluted 
1:10. 1:100. 1:1000. 1: 10.000. respectively. In Figure 41. the following amounts of 
purified PEF were added: lane 1- ipl of buffer, lane 2- Ipl PEF neat, lanes 3-8. Ipi 
PEF diluted 1:10. 1:100. 1:1000. 1:10.000. 1:100.000. and 1:1.000.000. 

Figures 42 and 43. PGR enhancing activity of heparin sepharose-purified 
P. furiosus PEF. PGR enhancing activity was measured using the 6.2 kb test 
system described. The PEF fraction (=40 ng/pl; prep. 2 in text) was diluted in 1 x 
cloned Pfu PGR buffer. In Figure 42. Ipl aliquots of the following were added to 
SOpI PGRs- lane 1. buffer; lanes 2-5. PEF diluted 1:10, 1:100. 1:1000. 1; 10.000. 
respectively. In Figure 43. Ipl aliquots of the following were added to 24mI PGRs- 
lane 1. buffer; lane 2. PEF undiluted; lanes 3-6. PEF diluted 1:10. 1:100. 1:1000. 1: 
10.000. respectively. DNA mariners were electrophoresed in lane "m". 

Figure 44. PGR enhancing activity of heparin sepharose column fractions. 
The PGR enhancing activity contained in 1 pi of each column fraction (fractions 1 -93 
from Figure 32; numbered at top) was measured using the assay 6.2kb primer- 
template PGR described in Example 1. Fractions 50-61 contained the peak of Pfu 
DNA polymerase activity and were excluded from analysis. 

Detailed PescrlDtlon of Snecifie Fmhn,^ iments nf thp »n»/..nt ln 
The following description should not be construed to limit the scope of this 
invention to any specifically described embodiment. Various aspects and 
embodiments of this invention will be apparent from the disclosure as a whole in 
context with the knowledge of one skilled in the art. In addition, the description 
herein, in combination with infonnation known or available to persons of ordinary 
skill in the art. enables the practice of the subject matter encompassed by the 
following claims. 

For the purposes of this invention, a nucleic acid replication reaction can 
mean any of the numerous nucleic add amplification, primer extension, reverse 
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transcription, or other nucleic acid polymerization reactions known in the art. 
Additionally, a replication reaction of this invention includes any reaction in which 
the ability of an enzyme to interact with a first nucleic acid and generate a second, 
substantially complementary nucleic acid sequence, is involved. The amplification 
reactions of this invention are not limited to PCR processes or any particular PCR- 
based assay, although they are particulariy useful herein, and specifically include 
RT-PCR processes. The proteins, preparations, compositions, mixtures, kits and 
methods of this invention can be used with any appropriately designed nucleic acid 
replication reaction. 

As used herein, the temi "PEP refers to a naturally occurring protein 
derived from a bacterial, eukaryotic. or archael source (or a wholly or partially 
synthetic copy or analog thereof) having polymerase enhancing activity, mixtures of 
one or more such proteins, protein complexes containing one or more such 
proteins, or extracts containing one or more of such proteins, mixtures or 
complexes. Recombinant PEF proteins, as a wholly synthetic copy of a naturally 
occurring protein, and complexes with at least one recombinant PEF protein are 
also "PEFs" according to this invention. The Pfu P45 and P50 proteins of this 
invention are illustrative of PEF proteins, which exhibit an apparent molecular 
weight of approximately 45kD and 50kD and are predominant components of a PEF 
complex derivable from Pfu. Data relating to both the P45 and P50 proteins is 
presented herein and details specific stmctural information. On SDS-PAGE. the 
non-heat-treated PEF complex containing P45 and P50 as well as minor additional 
components migrates with an apparent molecular weight > 250kD. One species of 
the PEF complexes of this invention is the P300 complex from Pyrococcus furiosus. 

The present invention is intended, however, to encompass other PEF 
proteins, mixtures, complexes, compositions, and extracts derived from organisms 
other than Pfu identified by techniques analogous to those provided by the following 
examples, or by use of the structural information on the PEF proteins described 
herein or derivable from the proteins described herein. More specifically, the 
invention is intended to encompass PEFs identified on the basis of sequence 
homology to all or pari of the PEFs described herein, including nucleic acid 
homology to all or part of the DNA sequence encoding the proteins described 
herein or the DNA sequences described herein. Computer-implemented homology 
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searches using the sequence information herein, stored on an appropriate 
computer-readable medium, with procedures known in the art. can generate these 
homologous PEFs. Also, reactivity with antibodies to the proteins, complexes, or 
extracts disclosed herein can be used with procedures known in the art to generate 
homologous PEFs. 

One skilled in the art is familiar with methods of generating analogs of 
proteins. Various techniques from publications in the art can be used to mutate, 
modify, tmncate. or otherwise change a protein's amino acid sequence and retain 
functional activity. In the case of a dUTPase activity as PEF. the known crystal 
structure of dUTPases. such as £ cotf dUTPase. provides specific infomiation on 
regions of a dUTPase that may be mutated in any of a number of ways while still 
retaining PEF activity (see Cedergren-Zeppezauer. E, S.. ef a/.. Nature 355: 
740(1992). which also notes similarities to mammalian dUTPases in Hokari. S.. ef 
aA, Arch. Biochem. Biophys. 253: 350 (1987)). Similarly, wholly or partially 
synthetic or recombinantly expressed proteins can also be generated from the 
information herein by those skilled in the art (For example. Ausubel et at. (1989) 
Current Protocols in Molecular Biology, and supplements through February 1997.) 

Furthemiore. a PEF can also be a protein exhibiting a dUTPase activity, as 
demonstrated herein. Specifically, human. Pyrococcus furiosus, and Therwus 
thermophilis dUTPase activities can be used to enhance polymerization reactions 
as a PEF. Other eukaryotic. bacterial, and archael sources can similarly be used to 
derive a PEF as a dUTPase protein. A PEF can also be any protein that functions 
to turn-over dUTP or lower the concentration of dUTP. Similarty. the invention 
comprises a method of enhancing replication or amplification reactions by reducing 
the dUTP concentration or preventing the incorporation of dUTP into replicated or 
amplified products, as well as compositions that are capable of preventing that 
incorporation. 

The polymerase enhancing activity of the PEFs of this invention can be 
detennined in a number of different ways. The description below details a few 
examples of assays and techniques one skilled in the art can use to detemiine if 
PEF activity is present. These assays and techniques can be used alone or in 
combination. 
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Example 1 specifically details screening assays and the "on/off' assay. This 
type of PGR assay allows one to identify the presence of a polymerase enhancing 
activity in a sample, More generally, any assay that shows an increase in PGR 
product yield, over a negative control level, when a sample suspected to contain a 
5 polymerase enhancing activity is added can be used to identify a polymerase 

enhancing activity. Also, any assay that shows an increase in processivity. over a 
control level, reflected by the increased length of PGR products being generated 
when a sample suspected to contain a polymerase enhancing activity is used. A 
combination of PGR product yield and increased processivity can also be used to 

10 determine whether or not a polymerase enhancing activity is present. 

A polymerase enhancing activity can also be identified by assays that 
indicate a reduction in the PGR inhibitory action of incorporated dUTP. For 
example. PGR reactions can be conducted in the presence of dUTP and samples 
suspected of containing polymerase enhancing activity. Those reactions that allow 

1 5 polymerization in the presence of dUTP indicate a polymerase enhancing activity in 

the form of a dUTPase activity. Thus, a dUTPase activity can be a polymerase 
enhancing activity. 

Also, a composition that functions to turn-over dUTP. especially under 
thermophilic reaction or PGR reaction conditions, can be a polymerase enhancing 

20 activity as a dUTPase. An enzyme or activity that acts on dUTP so that it is not 

incorporated into a newly polymerized strand functions to turn-over dUTP. The 
tum-over of dUTP can also be detected by an assay for the conversion of dUTP into 
dUMP, as detected by analyzing the reaction products by HPLG. for example. 
Biochemical assays that detect the conversion of dUTP into dUMP, or other 

25 nucleoside phosphate or metabolic derivatives or products, can be devised or are 

known in the art and can be used to identify polymerase enhancing activity as a 
dUTPase activity. 

A polymerase enhancing activity can also be a dUTPase enzyme that 
possesses the consensus uridine-binding sequence motif (SEQ ID NO.: 72). A 

30 number of those enzymes are identified below. However, numerous others exist or 

can be identified ttirough computer-implemented or other sequence analysis 
procedures known in th art. Thus, the presence of the consensus uridine-binding 
motif or the related sequences shown herein can also be used to define an enzyme 
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or protein that is a PEF, such as a protein that comprises SEQ ID NO.: 72, or any 
one of SEQ ID NOs.: 72-81. or combinations of these sequences. 

Also, proteins identified through sequence identity comparisons known in the 
art can be used to confirm the presence of a PEF. For example, proteins from one 
species possessing a sequence identity of approximately 18% or greater have been 
shown in the art to be related to or analogous to the known protein of another 
species. In the examples below, a sequence similarity of approximately 39% 
suffices to positively identify a dUTPase activity that can act as a PEF. 

The antibodies to PEF described herein can also identify a protein with 
polymerase enhancing activity. For example. Western blot analysis of composiUons 
from various archeal, bacterial, thermophilic bacterial, or eukaryotic samples can 
identify a protein that possess polymerase enhancing activity. Furthermore, as the 
PEF proteins and complexes of this invention are demonstrated as immunogenic, 
various other antibodies to PEF may be produced by techniques known in the art 
with the information herein. These other antibodies can also be used to identify a 
PEF. 

Example 1 
Screening for PEF Activity 

Protein-containing extracts from a number of different sources can be tested 
for PEF activity. The extracts can be prepared in a number of ways known in the 
art. 

One method was demonstrated with Pfu DSM 3638 cells. The cells were 
grown, a cell paste collected by centrifugation and then frozen at -80'C. The paste 
was taken up with lysis buffer [50 mM Tris-HCI (pH 8.2). 1 mM EDTA. 10 mM B- 
mercaptoethanol. 0.5mM PMSF. and 2\iglm\ aprotinin], and thereafter the cells 
were lysed in a French press and then sonicated. Following sonication. the lysate 
was centrifuged and the supematant, containing potential PEFs, was collected for 
assays. 

Extracts from any cell producing a PEF, for example, cells transfected with a 
recombinant vector directing the expression of a PEF, can also be assayed. 
Methods of making extracts of these cells are known in the art and are exemplified 
below. 
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1. Screening Assays f r PCR-Enhancing Activity 
One method of detecting thermostable PEFs is by screening partially- 
purified fractions from themiophilic archeal or bacterial extracts for PGR enhancing 
activity. PGR enhancing activity can be detected in samples consisting of column- 
purified fractions as well as homogeneous protein samples and proteins recovered 
by elution from SDS-PAGE gel slices (see below). Samples are added to PGR 
amplification reactions containing DNA polymerase, buffer. dNTPs, primers, and 
DNA template. PGR enhancing activity is identified by an increase in PGR product 
yield for amplifications conducted in the presence of a particular sample (DNA 
polymerase + PEF) as compared to amplifications conducted in the absence of 
added sample (DNA polymerase only). 

When screening samples suspected of containing endogenous DNA 
polymerase activity, for example protein extracts, negative controls can be 
perfonned in which the exogenous DNA polymerase has been omitted from the 
PGR amplifications. In addition, when screening samples contaminated with DNA. 
negative controls can be earned out in which exogenous DNA template is omitted 
from the PGR amplifications. 

The sensitivity of the PGR enhancing assay is dependent on the complexity 
of the DNA targets employed. PGR reaction parameters (target complexity, DNA 
template concentration, polymerase concentration. PGR cycle number or extension 
time) can be adjusted so that the yield of PGR product is barely detectable under 
normal conditions. In addition, samples for testing can be diluted appropriately so 
that the concentration of PEFs falls within the detectable range of the PGR 
enhancing activity assay. 

2. Screening Assays for PCR-Enhancing Activity Using Pfu-derived 
Samples 

A number of amplification assays can be designed to detect the presence or 
absence of PEF activity, and/or compare PEF activity between samples. Generally, 
these tests employ a sample containing a rare sequence to be amplified. The 
sequence is so rare, or the conditions so designed, that amplification under nomial 
situations results in barely detectable or no detectable amplified product. By adding 
a sample with putative PEF activity, any effects on the amount of amplified product 
fonmed can be detected. 
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One particular screening assay is called the "On/OfT assay, which detects 
the presence or absence of PEF. The "On/OfT assay results in appreciable 
amplified product only when PEF activity is present, or a detectable difference in 
amplified product when PEF activity is present compared to when PEF is not 
present Methods for detecting the amount of amplified product are known in the 
art and include those using electrophoresis and hybridization. 

One embodiment of an assay used to screen for PEFs, in this case from 
Pymcoccus furiosus, involved amplifying a 6.2 kb target from transgenic mouse 
DNA (6.2 kb primer-template assay). A master PGR cocktail was prepared 
consisting of: 200mM each dNTP. 2 pg/ml primer F432-21 (S'-CTA-TTG-AGT-ACG- 
AAC-GCC-ATC)(SEQ ID NO.: 62). 2 pg/mi primer R6656-20 (GTC-ACG-CTT-GCT- 
CCA-CTC-CG){SEQ ID NO.: 63). 2.5 pg/ml AAA742 DNA (transgenic mouse DNA 
having 40 copies of a lambda phage DNA shuttle vector). 1x cloned Pfu DNA 
polymerase PGR buffer (10mM KCI. lOmM (NHJ^SO,, 20mM Tris-HCI (pH 8.8). 
2mM MgSO,. 0.1%(v/v) Triton X-100. and lOOpg/ml BSA). and 25U/ml cloned Pfu 
DNA polymerase. Samples were diluted in 1x cloned Pfu DNA polymerase PGR 
buffer, and Ipl of each diluted sample was added to 24|jl of the PGR cocktail. PGR 
amplifications were conducted on the RoboGycler 96 Temperature Gycler 
(Stratagene), using the following conditions: 96'C for 45s (1 cycle)/ dB'C for 45s; 
60X for 45s; 72'C for 14 min. (35 cycles)/72°G for 10 min. (1 cycle). PGR 
products (10 pl/lane) are run out on 1% agarose gels and PGR product bands 
visualized by ethidium bromide staining. Samples with PGR enhancing activity 
exhibit higher yields of the expected 6.2 kb PGR product than can be obtained in 
the presence of DNA polymerase alone. When PGR enhancement is due to the 
presence of PEFs. rather than contaminating endogenous DNA polymerase activity, 
amplifications performed in the absence of exogenous DNA polymerase (e.g. Pfu 
DNA polymerase) yield no PGR product. Moreover, PGR amplifications performed 
in the absence of exogenous DNA template should yield no PGR product when the 
PEF sample lacks contaminating target DNA. 

A second embodiment of an assay to screen for PEF employs, for example, 
the 5.2 kb human a 1 -antitrypsin gene in a PGR amplification. PGR amplification of 
this primer/template system was so limited that, in the absence of PEF, it was 
difficult to detect any PGR product. With added PEF activity, a 5.2 kb product was 
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easily detected. The following conditions were used for this "On/Or assay: In 100 
Ml ■ IX Cloned Pfu DNA polymerase buffer. 200 pM each dmp, 200 ng primer F- 
91-23. 200 ng primer R5271-21. 125 ng Human Genomic DNA. 2.5 units cloned Pfu 
DNA polymerase. +/- PEF or recombinant P45 (rP45). 
Primer F91-23 5' gaggagagcaggaaaggtggaac 3' (SEQIDNO:64) 
Primer 5271-21 5' GCTGGGAGAAGACTTCACTGG 3* (SEQ ID NO: 65) 

The PGR cycling conditions were as follows: 95X for 1 minute (1 cycle). 95'C for 1 
minute - 60'C for 1 minute - 72-C for 10 minutes (30 cycles). After completion, the 
reactions are run out on an electrophoresis gel and the quantity of reaction products 
determined by any of a number of methods known in the art 

3. Screening Assays for Nucleic Acid Replication Enhancing Activity 
Extracts can also be added to any nucleic acid replication reaction to 
determine PEF activity. Many of these reactions are known in the art. including 
primer extension reactions. DNA sequencing reactions, site^jirected mutagenesis 
reactions, and a number of PCR-based reactions. (Ausubel. F.M.. et al. (1989) 
Current Protocols in Molecular Biology. Greene Publishing Associates and Wiley- 
Interscience. New York. NY; Sambrook. J., et al. (1989) Molecular Cloning: A 
Laboratory Manual. 2nd Ed.. Cold Spring Harbor Laboratory Press. Cold Spring 
Harbor. N.Y.) By comparing the results produced in a nucleic acid replication 
reaction with and without the added extract, one can identify the presence of PEF. 

Example 2 
Purification of PEF from P. furiosus 
Once PCR enhancing activity has been detected from, for example, archeal 
or bacterial sources, large amounts of purified PEF can be obtained by column 
chromatography. The following protocol was developed for purifying PEF from P. 
furiosus (Pfu). However, one skilled in the art will appreciate that other cells or 
species could be used as well. 
1. Cell Growth and Lysis 

P. furiosus DSM 3638 cells were grown in a 400 liter fermentor according to 
established protocol. (U.S. Patent No. 5.545.552. specifically incorporated herein 
by reference.) The cell paste was collected using a Sharpies in-line centrifuge after 
approximately 20 hours (A600 =0.5). and then immediately frozen In liquid Nj and 
stored at -80X until use. Then. 500 grams of frozen cell paste was transferred to a 
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4 liter stainless steel beaker on ice. The cells were resuspended with 2 liters of 
lysis buffer, consisting of 50 mM Tris-HCI (pH 8.2). 1 mM EDTA, 10 mM B- 
mercaptoethanol, O.SmM PMSF. and 2^g/ml aprotinin. The cells were lysed in the 
French press using 2 passes at 8K PSI and the lysate was then sonicated for 10 
minutes. Following sonication. the lysate was transfen-ed to 400 ml bottles, spun fc 
1 hour at 9K rpm in a Sorvall RC-2B centrifuge using a Sorvall GS3 rotor, and the 
supernatant collected. 

2. Purmcation by Column Chromatography 
The supematant was loaded at a flow rate of 5ml/min. onto a 10 x 5 cm Q- 
Sepharose Fast Flow™ (Pharmacia) column (=392 mis), pre-equilibrated in buffer 
consisting of 50 mM Tris-HCI (pH 8.2), 1 mM EDTA, and 10 mM 8- 
mercaptoethanol. The column was washed with 2 column volumes of buffer, and 
the pass-through and column washes were collected and pooled. The pooled 
fractions were adjusted to pH 7.5 using IN HCL 

The Q-Sepharose pass-through was then loaded at a flow rate of 5 ml/min. 
onto a 5 X 11.5 cm (=225 mis) SP Sepharose Big Bead^ (Pharmacia) column, 
equilibrated in buffer containing 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 1 mM DTT, 
10% (v/v) glycerol. 0.1% (v/v) Igepal CA-630. and 0.1% (v/v) Tween 20. The 
column was washed with equilibration buffer until the absorbance (ODjao) 
approached baseline. The column was eluted with a 2 liter gradient from 0 to 250 
mM KCI (in equilibration buffer). Fractions of 20 ml were collected, and aliquots 
removed from every third tube for SDS-PAGE analysis. 

Some of the fractions analyzed by SDS-PAGE showed a band > 250kD 
when a sample was not heated prior to electrophoresis (=300kD). The fractions 
containing the 300 kD band were pooled and dialyzed overnight against 2x4 liters 
of Buffer A [50 mM Tris-HCI (pH 8.2), 1 mM EDTA, 1 mM DTT, 10% (v/v) glycerol. 
0.1% (v/v) Igepal CA-630, and 0.1% (v/v) Tween 20]. The dialyzed pool was loaded 
at a flow rate of 2 ml/min. onto a 2.6 x 29 cm (=154 mis) Heparin Sepharose CL- 
6B™ (Phamiacia) column, equilibrated in Buffer A. The column was washed with 1 
liter of Buffer A, and then eluted with a 1 .5 liter gradient from 0 to 300 mM 
KCI/Buffer A. Fractions of 10 ml were collected, and aliquots removed from every 
third tube for SDS-PAGE analysis. Fractions containing the 300kD band were 
pooled and dialyzed ovemight against 2x4 liters of Buffer A. 
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The heparin sepharose-purified pool was loaded at a flow rate of 0.5 ml/min. 
onto a 1.6 X 95 cm (=191 mis) Sephacryl S-200 High Resolution'" (Pharmacia) 
column equilibrated in Buffer A containing 100 mM KCI. Then. 2 ml fractions were 
collected and aliquots removed from every third tube for SDS-PAGE analysis. 
Fractions containing the 300kD band were pooled and dialyzed overnight against 1 
liter of buffer containing 50 mM Tris-HCI (pH 8.2). 0.1 mM EDTA. 1 mM DTT. 50% 
(v/v) glycerol. 0.1% (v/v) Igepai CA-630. and 0.1% (v/v) Tween 20. The purified 
protein was stored at -20'C. The purification protocol described above yielded = 1 
mg. of relatively homogeneous P300 band from 500 g. of cell paste. 
3. Purification of PEF from SDS>PAGE gels 
PEF in a heterogeneous sample can be identified by eluting purified protein 
from SDS-PAGE gel slices and rescreening for PCR enhancing activity. This 
method allovw rapid assessment of the number of PEF proteins in a particular 
sample and identification of their apparent molecular weight. 

P. furiosus protein samples with PCR enhancing activity were 
electrophoresed on 4-20% acrylamide/2.6% bis-acrylamide Tris-Glycine gels 
(Novex). along-side pre-stained molecular weight markers. Samples were loaded in 
the presence of 2% SDS. but were not boiled in order to prevent dissociation of 
PEF complexes. The gels were run in Tris-Glycine buffer containing 1% SDS, and 
after electrophoresis, were washed briefly in 20mM Tris-HCI (pH 8.0)/1mM EDTA. 
Then. 2-3mm gel slices were excised from the lanes of interest using the pre- 
stained molecular weight mariners as a guide. Each gel slice was cut-up with a 
razor blade and the pieces transferred to 50^1 of elution buffer (20mM Tris-HCI (pH 
8.0)/1mM EDTA/0.1% Tween-20). The slurry was incubated at 72°C for 30 
minutes. 

Gel slices containing PEFs are identified by testing the eluates for PCR 
enhancing activity. Eluates containing >0.1 ng//il PEF are then re-analyzed on 
silver-stained SDS-PAGE gels to verify the apparent molecular weight of the 
predominant protein component. The gel slice eluates are boiled in the presence of 
2% SDS before loading and the apparent molecular weights of PEF proteins 
detennined relative to protein standards. The gel slice elution procedure described 
here allows recovery of 1-10% of the protein of interest. 
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Example 3 

_ Identification of the PCR Enhancing 
Activity in P. furiosus Partially-Purlfied Column Fractions 

The fractions collected after the heparin sepharose chromatography were 
analyzed for PEF activity using the PCR screening assay (Example 1). The 
addition of diluted heparin sepharose fraction dramatically increased yields of PCR 
products generated with cloned Pfu DNA polymerase. The PCR enhancing activity 
of the fractions was shown to be dependent upon the presence of exogenous DNA 
template (Figure 1) and Pfu DNA polymerase (Figure 2). Increased PCR product 
yield was, therefore, not due to the presence of contaminating DNA template or 
native Pfu DNA polymerase, but rather to the presence of PEFs. 

In order to further characterize the factor or factors responsible, the 
following was performed. PEFs after heparin sepharose chromatography were 
identified by screening SDS-PAGE gei-purified samples for PCR enhancing activity, 
as discussed above in Example 2. When the protein samples were loaded onto 
SDS-PAGE gels without pre-boiling, PCR enhancing activity (Figure 3) was 
recovered in 2 gel slices from the gel of Figure 4. One gel slice (gel slice #1 ) was 
excised from a position between the 42 and 60lcD markers, while the second gel 
slice (slice #2) was recovered from a site just above the 250kD marker (Figure 4). 

The proteins eluted from the gel slices were also screened for DNA 
polymerase activity to demonstrate that PCR enhancing activity was not related to 
contaminating DNA polymerase activity (Figure 5). The results indicated that SDS- 
PAGE purified proteins with PCR enhancing activity lack significant DNA 
polymerase activity. Moreover, SDS-PAGE purified Pfu DNA polymerase lacks 
PCR enhancing activity when tested at protein concentrations comparable to or 
greater than those of gel-purified PEFs. 

The P. furiosus PEF proteins were conclusively identified by re-analyzing gel 
purified samples with PCR enhancing activity on silver stained SDS-PAGE gels. In 
the absence of pre-boiling, the predominant band in gel slice #2 migrates somewhat 
slower than the 250kD molecular weight mari<er, consistent with the site where the 
gel slice was recovered (Figure 6). This band is called P300 or PEF complex. In 
addition, a minor band is evident at 50kD, called P50. Gel slice #1 contained a 
major band at 50kD and a minor band at 45kD. 
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However. when the proteins eluted from gel slice #2 are boiled in SDS prior 
to loading, the predominant component migrates with an apparent molecular weight 
of 50kD (Figure 6). A minor or poorly staining component of =45kD is also visible. 
These results are consistent with P. furiosus PEF consisting of two distinct proteins, 
with apparent molecular weights of 50kD and 45kD. which aggregate at low 
temperatures in presence of SDS to produce a complex which migrates as a 
discrete band at 300kD. 

Example 4 

Characterization of PEF Complex and Protein Components of the Complex 

P. furiosus PEF fractions following S200 chromatography comprises a 
mixture of proteins. A discrete band in SDS-PAGE migrates above the 250kD 
marker when the sample is not heated prior to electrophoresis. This protein is 
called P300 or PEF complex. When the conditions are changed to substantially 
dissociate the PEF complex, the subunit protein components of the PEF complex 
are visualized in SDS-PAGE. 

1. Temperature-dependent Dissociation of PEF Complex 
One method of dissociating PEF complex into its component proteins is heat 
treatment. In the absence of heating, the majority of PEF migrates as a complex, 
running slightly slower than the 2S0kD molecular weight marker. Minor amounts of 
dissociated P50 and P45 are visible in the unhealed sample. However, after heat 
treatment at temperatures of about 85°C or higher, the PEF complex is completely 
dissociated as indicated by the absence of the 300kD band by SDS-PAGE. The 
predominant protein component of PEF complex, as visualized by silver-staining, 
exhibits an apparent molecular weight of approximately 50kD in SDS-PAGE. The 
P50 band is shown in the gel of Figure 7. where the protein was heated to >85'C in 
the presence of 2% SDS and BME prior to loading. In addition, the P45 protein 
shown to be present in the gel-purified PEF complex (Figure 6) is also evident in the 
heat-treated, S200-purified PEF sample (Figure 7). Furthennore, minor 
components with apparent molecular weights of approximately 37. 42, 55, 60, 85, 
100, and 150kD were also detected in SDS-PAGE analyses of PEF complex 
treated at temperatures of 85-1 OOX. These minor components may represent 
additional forms of P50 and P45 generated by heat treatment (e.g. dimers, trimers, 
fragments) or minor unrelated species. 
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2. Polymerase Enhancing Activity of PEF Complex and 
Component Proteins and Mixtures 

Protein components of S200-purified PEF were purified by SDS-PAGE and 
the PGR enhancing titer was determined by adding serial dilutions of each gel slice 
eluate to PGR reactions with cloned Pfu DNA polymerase PGR reactions. The 
protein or protein mixtures which exhibited the highest levels of polymerase 
enhancing activity were identified by running the eluates on silver stained SDS- 
PAGE gels. Analysis of SDS-PAGE gel slice eluates indicates that PGR enhancing 
activity of S200-purified P. furiosus PEF can be attributed solely to the 45kD 
species plus the 50kD species. When an S200-purified preparation was loaded in 
the absence of heating (Figure 7. prep. 1 ). PGR enhancing activity was present in 
gel slices recovered just above the 250kD marker and between the 42 and 60kD 
markers. When heated to 85»G before loading, PGR enhancing activity migrated 
between the 42 and 60kD markers (Figure 8). 

Titration experiments showed that the PGR enhancing titer of gel purified 
proteins migrating with apparent mass between 42kD and 60kD was not 
significantly different from that of the gel-purified PEF complex (300kD band). The 
levels of PGR enhancing activity migrating between the 148 and 60kD mariners 
were insignificant, thereby indicating that the 85kD. lOOkD. and 150kD bands do 
not contribute substantially to full PGR enhancing activity of the PEF complex 
(P300). Moreover, these components do not appear to further enhance the activity 
of PEF. 

Figure 9 shows the proteins recovered from 4 gel slices between the 42 and 
60kD mari<ers from 2 heated treated PEF samples. The greatest PGR 
enhancements were observed for protein samples run in lanes 5. 9, and 10. These 
lanes contained the highest amounts of P45, in addition to low but detectable 
amounts of P50. Relative P50 concentration did not necessarily con-elate with 
highest PGR enhancing titer, as the proteins mn in lane 4 (where only P50 is 
visible) exhibited a lower titer than the protein mixture in lane 5 (same amount of 
P50. plus P45). Moreover, samples in lanes 9 and 10 exhibited similarty high PGR 
enhancing titers and levels of P45. but the sample in lane 9 contained 10 to 1000- 
fold more P50 than the sample in lane 10. These results are consistent with P45 
being the most active component of the PEF complex. Since all samples of gel 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCTAJS98/05497 



•31- 

purified P45 isolated contain varying concentrations of P50, the exact biochemical 
role a low concentration of P50 has on attaining full PEF activity or stability could 
not be determined by this method. 

Example 5 

Amino Acid Analysis of PEF Complex and PSO and P4S Components 

The complex and the predominate 50kD component (PSO) and 45kD (P45) 
component from Pfu were sequenced at the N-terminus. In addition. N-terminal 
sequence analysis was performed on the minor 100 and 150kD components 
generated upon heat dissociation. Two analyses were performed. In the first 
study, heparin sepharose-purified PEF samples (=20% homogeneous; prep. 4 in 
Figure 10) were eiectrophoresed and electroblotted onto PDVF filters. Samples 
were loaded onto 4-20% SDS-PAGE gels at room temperature or after heat 
treatment, to allow recovery of both the 50kD protein and the >250kD complex 
(Figure 1 1 ). Blots were sent to Wistar Sequencing Facility (Philadelphia. PA) for 
analysis. N-tenminal sequence analysis was performed on both the SOkD (PSO) 
protein (heated SDS-PAGE sample) and the >250kD PEF complex (unheated SDS- 
PAGE sample). 

The N-temninal amino acid sequences of the PEF complex (P300) and the 
SOkD component (Pfu PSO) were found to be substantially identical (Table A). This 
data confirmed that Pfu PSO is the predominant component of the PEF complex. 
Two distinct sequences were found for both P300 and PSO (r and 2°), suggesting 
that Pfu PEF may contain 2 different SOkD species which co-migrate, or that the 
PVDF strip containing the SOkD species was contaminated with the 45kD species or 
other species visible by SDS-PAGE. 

In addition to the N-terminal sequencing, the SOkD protein was also subject 
to in situ trypsin digestion and microbore reverse HPLC. A subset of tryptic 
peptides was analyzed by mass spec. Two peptides with single masses (#107, 
#112) and one peptide with two masses (#108) were chosen for sequence analysis. 
Two internal peptide sequences from Pfu PSO were recovered (Table A; Tryptic 
Peptides). Peptide #1 12 was 24 amino acids in length and the calculated mass of 
the Edman sequence (2S30.8) was in very good agreement with the observ d 
peptide mass (2S31.7). Peptides #107 and #108 contained multiple sequences 
which could not be sorted by Edman sequencing alone. However these peptide 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-32- 

fractions eluted very close together on microbore HPLC and contained several 
residues in common. Based upon shared sequence and mass analysis, a tentative 
sequence was assigned (107/108; Table A). 
Table A - Analysis 1. 
N"terminal Sequencing 

Protein Mass (Pa) seaufinrp/rsEo th no- ) 

Sample Obs . 

PEF complex «50,000 V XLLHH VKLIY ATXXfi (i) 

2' XXXPD WXXRX EXLXX (2) 

P50 »50,000 V iiLLHH VKLIY ATK£R (3) 

BLVGK KIVLA IPGXI AAVEp 

2' XXXPD W£XRX EXLfiE ££1 (4) 

Tfyptic Peptides 

ffifitidfi Mass fPfl) Sequence/ L^T -O ID m-. ) 

Comments 

Qbs. Calc. 

107 1389.59 N/A 

multiple sequences 108 1659.1,1910.63 N/A 

multiple sequences 
107/108 - 1910.3 KVDAV IMAAA VVDFR PK (5) 

AAs common to 107/108 

112 2531.73 2530.8 ADLVV GNTLE AFGSE (6) 

ENOW LIGR 

"X" represents any amino acid 

underlined residues represent amino acids that may be deleted or 
substituted with any amino acid but are tentatively assigned as indicated 

The 35 amino acid sequence recovered from the N-terminus of Pfu P50 
(SEQ ID NO.: 3). and the two internal peptides of 17 and 24 amino acids (SEQ ID 
NO.:s 5 and 6), represent approximately 16% of the total amino acid sequence of 
Pfu P50, assuming an apparent molecular weight of 50kD and a length of 
approximately 454 amino acids. 

In the second round of analyses, the N-terminal sequences of the 150, 100, 
50. 45. and 42kD species were determined from a PVDF blot of heated S200- 
purified PEF (Figure 12 and Table B). This analysis was performed by the 
Beckman Research Institute of the City of Hope (Duarte, CA). The N-terminal 
sequences of the 150 and lOOkD species w re the same and identical to the major 
sequence in the 50kD band (except for ambiguity at the N-terminus) and similar to 
the minor sequence in the 45kD band. In addition, the sequence was very similar to 
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the major sequence in the PEF complex and the 50kD band detemiined in the 
Wistar sequence analysis. The data are consistent with the 150 and lOOkD species 
being alternative forms of the 50kD species (e.g.. dimers. trimers. or aggregates). 
The major N-temiinal sequence of the 45kD band (2 blot sections analyzed; "upper 
and -lower") was distinct from the P50 sequence and very similar to the minor 
sequence found in the 50kD bands analyzed by both Wistar and Beckman and in 
the PEF complex analyzed by Wistar. No N-temninal sequence was recovered for 
the 42kD species. In total, these data are consistent with the PEF complex 
consisting of 2 distinct protein components, P50 and PAS. 
Table B - Analysis 2. 

Protein fiample Sequence 
(SEQ ID KO) 

(MW) 

^^^^^ (GAM)LHHV KLIYA JKLRK (7) 

^^^^^ (GAM)LHHV FCLIYA TK(ia)RK (8) 

5^ 1' M LHHV KLIYA TKL (9) 

2* GL(ia) PD W(WK) (KDRK EES (10) 

45 (upper) V (£AI)LLPD WKIRK EILIE {11} 

2' XMHH{VI) KLXYA TXSRK (12) 

45 (lower) V M(LY) (LV) (RP) D WKRRK EILIE (13) 

42 no sequence 

X represents any amino acid; 

underlined residues represent amino acids that may be substituted with any 
amino acid but are assigned as indicated; 

amino acids within parenthesis ( ) indicate that one of the enclosed amino 
acids is present at that site. 

Using sequence information stored in a computer readable medium, one 
skilled in the art can perform computer-implemented homology searches. Here, the 
nonredundant GenBankCDS translations+PDB+SwissProt+SPupdate+PIR protein 
databases, using BLASTp, indicated that the partial amino acid sequence of Pfu 
P50 and P45 do not exhibit identity to any protein in those databases. 

Example 6 

Nucleotide and Predicted Amino Acid Sequence of P50 
The nucleotide sequence of the P50 protein component was obtained by 
cloning the Pfu P50 using standard techniques. 
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1. Library Screening 

A Pyrococcus furiosus genomic library was plated on XL1-Blue MRF £ coli 
at a density of approximately 2000 plaques per plate. Duralose filters (nitrocellulose 
on a nylon backing) were used to take replicate lifts from each plate. While the first 
filter was on the plate, orientation marks were made by stabbing a needle through 
the filter and into the plate. The orientation marks were marked in pen on the back 
of the plate before the filter was removed. The filter lifts were treated as follows: 
1 .5-2.0 minutes 1 .5 M NaCI, 0.5M NaOH 

2 minutes 0.5 M Tris (pH 8.0), 1 .5 M NaCI 

30 seconds 2xSSC. 0.2M Tris (pH 7.5) 

After treatment, the filters were partially dried until they were still damp, but no 
standing water was visible. The DNA on the filters was fixed by UV crosslinking 
with the Stratalinker set to the "Autolink" fonnat. 
The filters were prehybridized in 15 ml of: 
5x SSC 

40 mM NaP04 pH (6.5) 
5x Denhardt's 
5% Dextran Sulfate 
50% Fonnamide 

0.1 mg/ml Salmon sperm DNA (Boiled separately and added immediately 
prior to use) 

Prehybridization was carried out at 42'*C for approximately 2 hours. 

Probe was generated from the 900bp PGR product amplified from Pfu 
genomic DNA with the following degenerate primers: 

Oligo #50: CAT CAT GAA AAA CTA ATT TAG GC (SEQ ID NO- 14) 

C C C G T C 
T T 

Oligo «61: GC CAT AAT TAC TGC ATC GTA TTT (SEO ID NO: 15) 

G C G G 
T G A 
A 

Oligo #50 was designed to hybridize to DNA encoding the HHVKLIYA (SEQ ID NO.: 
66) peptide in SEQ ID NO.: 1 . at the N-terminus of P50, while oligo #61 was 
designed to hybridize to the antisense DNA strand encoding the peptide 
KYDAVIMA (SEQ ID NO.: 67) in SEQ ID NO.: 5. 
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The PCR product was purified from free primers, buffer and nucleotides and 
50 ng was labeled with -P-adATP using the Stratagene Prime-It II Random Primer 
Labelling kit. The probe was purified from free nucleotides before being boiled for 
five minutes and added to the prehybridizatlon reaction. The total probe was 
roughly calculated to be 80 million cpm. 

Hybridization was allowed to continue overnight at 42»C before the 
hybridization solution was removed and the filters were washed four times with 0.1x 
SSC. 0.1% SDS at 60»C (stringent conditions). 

The filters were exposed to X-ray film ovemight and 20 primary isolates, with 
strong signals on both replicate filters, were picked. 

Six primary isolates were diluted, plated, and screened again using the 
same method described above. Of the six. three filters produced isolated lambda 
clones. The clones were confirmed by PCR amplification using the degenerate 
primers. All clones were able to produce the 900 bp amplified product with oligos 
#50 and #61, which was used as probe. Clones 6A and 3B produced a 1200 bp 
amplified fragment with oligos #54 and #58. Oligo #54 was designed to hybridize to 
DNA encoding the HHVKLIYA (SEQ ID NO: 66) peptide in SEQ ID NO: 1. and oligo 
#58 was designed to hybridize to the antisense DNA strand encoding the 
EENQWL (SEQ ID NO.: 68) peptide of SEQ ID NO.: 6. Clone 6D only produced a 
900 bp amplified product. 

Oligo #54: CAT CAT GAA AAA CTA ATA TAG GC (SEQ ID NO- 16) 

C C C G T C 
T T 

Oligo #58: AG TAC TAG TTG ATT TTC TTG (SEQ ID NO- 17) 

A G G C G 
A A 

Bluescript plasmid was excised from the lambda clones in SOLR cells and the 
presence of inserts confirmed again by PCR amplification of the 1200 or 900bp 
product. 

2. DNA Sequencing 

Sequencing was carried out on purified PCR products and plasmid mini- 
preps made from the excised cells. The nucleotide sequence is listed below with 
the predicted protein translation. The peptide sequences used to generate the 
probes are indicated by underiining. "N" repres nts any base and "X" represents 
any amino acid. 
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(SEQIDN0:18) 

ATGCTTCACCACGTCAAGCTAATCTACGCCACAAAAAGTCGAAAGCTAGT 

TGGAAAAAAGATAGTCNNNNNNNNNCCAGGGAGTATTGCGGCTTTGGATG 

TGAAAGCTTGTGAGGGACTAATTAGGCATGGGGCCGAAGTTCATGCAGTG 

ATGAGTGAGGCAGCCACCAAGATAATTCATCCTTATGCATGGAATTTGCC 

CACGGGAAATCCA6TCATAACTGAGATCACTGGATTTATCGAGCATGTTG 

AGTTAGCAGGGGAACATGAGAATAAAGCAGATTTAATTTTGGTTTGTCCT 

GCCACTGCCAACACAATTAGTAAGATTGCATGTGGAATAGATGATACTCC 

AGTAACTACAGTCGTGACCACAGCATTTCCCCACATTCCAATTATGATAG 

CCCCAGCAATGCATGAGACAATGTACAGGCATCCCATAGTAAGGGAGAAC 

ATTGAAAGGTTAAAGAAGCTTGGCGTTGAGTTTATAGGACCAAGAATTGA 

GGAGGGAAAGGCAAAAGTTGCAAGCATTGATGAAATAGTTTACAGAGTTA 

TTAAAAAGCTCCACAAAAAAACATTGGAAGGGAAGAGAGTCCTAGTAACG 

GCGGGAGCAACAAGAGAGTACATAGATCCAATAAGATTCATAACAAATGC 

CAGCAGTGGAAAAATGGGAGTAGCGTTGGCTGAAGAAGCAGATTTTAGAG 

GAGCTGTTACCCTCATAAGAACAAAGGGAAGTGTAAAGGCTTTTAGAATC 

AGAAAAATCAAATTGAAGGTTGAGACAGTGGAAGAAATGCTTTCAGCGAT 

TGAAAATGAGTTGAGGAGTAAAAAGTATGACGTAGTTATTJiTGGCAGCTG 

CTGTAAGCGATTTTAGGCCAAAAATTAAAGCAGAGGGAAAAATTAAAAGC 

GGAAGATCAATAACGATAGAGCTCGTTCCNNNNAATCCCAAAATCATTGA 

TAGAATAAAGGAAATTCAACCAAATGTCTTTCTTGTTGGATTTAAAGCAG 

AAACTTCAAAAGAAAAGCTTATAGAAGAAGGTAAAAGGCAGATTGAGAGG 

GCCAAGGCTGACTTAGTCGTTGGTAACACATTGGAAGCCTTTGGAAGCGA 

GGAAAACCAAGTAGTATTAATTGGCAGAGATTTCACAAAAGAACTTCCAA 

AAATGAAAAAGAGAGAGTTAGCAGAGAGAATTTGGGATGAGATAGAGAAA 
TTNCTGTCC 

Pyrococcus furiosus P50 predicted amino acid sequence: 
(SEQ ID NO: 19) 

MUiHVKLIYATKSRKLVCTarrvy^fl^^XftftT n;;T^^r^p^T T RHGAEVHAV 
MSEAATKIIHPyAWNLPTGNPVITEITGFIEHVELAGEHENKADLILVCP 
ATANTISKIACGIDDTPVTTWTTAFPHIPIMIAPAMHETMYRHPIVREN 
lERLKKLGVEFIGPRIEEGKAKVASIDEIVYRVIKKLHKKTLEGKRVLVT 
AGATREYIDPIRFITNASSGKMGVALAEEADFRGAVTLIRTKGSVKAFRI 
RKI KLKVETVEEMLSAI ENELRS K KYDWIM^AAVfinP ^ py T KAEGKI KS 

GRSITIELVPXNPKIIDRIKEIOPNVFLVGFKAETSKEKLIEEGKRQIER 
AKADLWGNTLEAFrrsPrMnWT.ypfpnrTvrT pym^^ppT f^mnrCICi: 
XLS 

Translated sequence corresponding to chemically-determined N- 
terminal sequence (SEQ ID NO.: 3): 

MLHHVKLIYATKSRKLVGKKIVXXXPGSIAA {SEQ ID NO: 46) 
Translated sequences corresponding chemically-determined internal 
peptide sequences (SEQ ID NOs.: 5 and 6): 

KYDWIMAAAVSDFRPK (SEQ ID NO* 47) 

ADLWGNTLEAFGSEENQWLIGR (SEQ ID NO: 48) 

The protein has a theoretical pi of 9.36 and a theoretical MW of 44801.29. 

There are inconsistencies between the chemically-determined AA sequence 
of P50 and the AA sequence derived from the P. furiosus P50 genomic clone. One 
Skilled in the art is familiar with many reasons for this type of inconsistency. For 
example, the inconsistencies below can, largely, be explained by known limitations 
common to the procedures used. These limitations do not operate to limit the 
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structural knowledge of proteins or nucleic acids. Instead, they merely indicate 
possible variations in the sequences of amino acids or nucleic adds by a finite 
number. 

Some of the inconsistencies and explanations are: 

£5LLHHVKLIYA TKfiRfi LVGKiJ IVLAI PGXIA ME£ (Table A; SEQ ID NO:s 1 
and 3) 

MLHHV KLIYA TKL (Table B; SEQ ID NO: 9) 

MLHHV KLIYA TKSRK LVGKK IVLAI PGSIA ALDV (predicted sequence from 
SEQ ID N0»:19) 

The inconsistency in Table A sequence at cycle 2 (extra AA between AA1 and 2) 
may be due to contamination with P45, which appears to have L*s at positions 2 
and 3. Moreover, L at cycle 2 in SEQ ID NO: 1 was assigned tentatively. Other 
inconsistencies between the Table A sequence and the predicted sequence occur 
at AA 15 (R vs. K) and AA 32-34 (VEP vs. LDV). 

An inconsistency between the Table B sequence and the predicted 
sequence was found at AA1 3. The identification of AA13 as L instead of S is 
explained by the poor recovery of S in chemical sequencing and the contamination 
of P50 with low amounts of P45, which has a L at that position. 

ADLW GNTLE AFGSE ENQW LIGR (Table A; SEQ ID NO- 6) 

ADLVV GNTLE AFGSE ENQW LIGR (predicted sequence * from 
SEQ ID NO. : 19) 

S)! tTable A; SEQ ID NO: 5) 

sS ?D JT^Jr^'' (predicted sequence from 

SEQ ID NO: 6, determined chemically from a P50 tryptic peptide, was identical to a 
24 AA sequence translated from the P50 DNA sequence. For SEQ ID NO: 5. there 
were 2 inconsistencies found between the chemical and DNA sequences. An A 
was recovered at cycle 4 instead of a V, and a V was recovered at cycle 12 instead 
of a S. The inconsistencies may be due to the difficulties associated with 
interpreting sequences from a sample that is not absolutely pure. 

Example 7 

Identification of Proteins Related to P50: Similarity to Exoli DFP Flavoprotein 
1. DNA Sequence Similarity of Pfu P50 Protein to Archael and 
Bacterial Proteins 

The DNA sequence of a P50 clone exhibits very strong homology to the 
flavoprotein DFP, a protein identified in £ co// as playing a role in DNA synthesis 
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and pantothenate metabolism (Spitzer and Weiss, J. Bacteriol. 164:994-1003 
(1985) and Spitzer, Jimenez-Blllini. and Weiss. J. Bacteriol. 170:872-876 (1988)). 
Although DFP was found to be an essential gene for DNA replication, these authors 
were not able to elucidate its role in DNA replication. The sequences in Table 1 (N- 
5 temiinal 1 ' sequence, tryptic peptides 107/108 and 1 12) are all found in the 

translated P50 clone, which exhibits very high similarity to DFP. Accordingly, the 
P50 amino acid and DNA sequence information can be used to identify related 
proteins associated with PEP complexes from other sources such as bacteria, 
eukaryotes, and archae. 
^ ° The amino acid sequences of Methanococcus jannaschii (Mja) and £ coli 

dfp proteins support the identification of the protein designated P50 (Pfu) as a 
member of the dfp family of proteins. The three protein sequences were compared 
using ClustalW 1.6. with the comparison data represented below. 



i nil HllSiSi SJlJESSISiS! JjyjJiSSf;^ -KACEGL1RHCA£VH AVHSCWTKUHPYA -KLPTCKPVITtlTC 7e 

I H3« — HISCJMHP TKUKGTKSKLLOIK KILVXVTSSIAMCT PKLKRCLIRHCAEVY CIlTCtTKKIIGHlA LKrcceiirvvprTTr ai 

folo zT «CHPTCKiiMsu« wvwvsGciAAm ni^u^cAsZ ^^l^SSiirns 12I5K??SSi5 II 

1" "0 121 133 136 150 151 US 166 iBfi 

1 PIU riEHVELAC£ HEKKWLILVCPATA KTISKIACCIDDTW TTWTTArPHIPlMI APAMHtTMYRMPIVR EMlMLK-KLCuirT 

^^S^t.COli PAAEAAHCHIEU:-. "KUADIVILAPATA DLlAAVAAfiMANOLV STICI^TP-ApiA" lJ!!S5S!t^ SSJiJSJSSIJi \n 



0^ f 210 211 225 226 240 241 »«c itr. 

1 ^ KX'ASlOCimviKK LHKKTLt-CKRVlVT AGATMYIDPIRFIT KASSCKKGVAUXCA DrRGAVTlIRTKCSV 245 

2 M)e SPKra CKA KVAKlEDWKAVItK ICKKLKKrOTRVia HCCTVEriDKVRVlS NLSSGWWVALAEAT aaGFYVEVtrlScL Itl 

i" ""'^O*^^"*^"* MXDPITIVWAVAH rSPVHDLKHlKIMlT ACPTWPLOmYIS NMslSSJiSiSI wSS^il^S^Ss III 

^n ^"'^ 286 300 301 315 316 330 ail i** Mt ,cr 

J SJJJfSiXSISS; Srr!!Xtr!S2' •"'«oki«sgr.... .-.tielvfxm«" S; niorK-vrJJc J30 

2 njfl EPPYYIKHHmTAK CMLHKAIE—L-AKD roillSSAAlSDPrv ES- rtCIOSSEEE" --LILKLKB-KPKVL ttLRSIYKD-KVI ir 
9.1.\T LOCAAVH-ASVOO ONIFICCAAVADYRA ATVAPiJTSo"oc OEtJ^SJU-UfSIi S!;S[!!5SKf{ic III 

390 391 405 406 420 421 tl*. 11* -i^ 

1 P'u FKAETSK-EK-LIEE CKAOlEAAKADtWC KTL CAfCSEEN OWllCRDrTKELPK MKKRElJU-RTMPr Tf wt\ I^, 

2 Hi. rXAEYKlOCKELlHR AKESLHKYNUWl lA MOLSK-HYrCDOYI ivYI m^KIsS SS^E^! ^ Sws"!: \l\ 

3 E.coU FAAETW VEtY AAOKRIRKNLOLICA KDVSOPTQCfKSDIlN AtHlJiSLpI ?SE"c5f[ISl[5 tITAksH OO 

From the above comparison, it would be apparent to one of skill in the art 
that related proteins from other species can be identified and isolated by methods 

40 known in the art. The example above employed stringent screening conditions. 

Less stringent conditions, varying the concentration of salts, detergent, or the 
temperature during hybridization or washing, as known in the art, would lead to 
related clones from libraries containing sequences of any of a number of species. 
For example, in addition to the conditions described above, any of the following 

45 hybridization conditions can be used, in any combination, in methods to isolate DNA 

sequences related to the P50 or P45 sequences herein: 
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low stringency wash in a solution comprising approx. .45 M NaCI, approx. 
.045 M trisodium citrate, and approx. .1% SDS. at approx. 37^ to approx. 
42"C; 

hybridization buffer comprising approx. .75M NaCI, approx. .15 M Tris, 
approx. 10 mM sodium pyrophosphate, approx. .075 M trisodium citrate, 
and approx. 50% formamide; 

hybridization buffer comprising approx. 5 x SSC, approx. 5x Denhardt's, 
approx. 5% Dextran Sulfate, approx. 50% formamide, and approx. .1 mg/ml 
ssDNA; 

hybridization wash comprising approx. .1 M phosphate, approx. ,1 x SET. 

approx. .1% sodium pyrophosphate, and approx. .1% SDS at approx. 45*C. 

2, Absorbance Spectrum of Purified P. furiosus PEF Complex 

The absortsance spectrum of purified P. furiosus PEF complex reveals two 
peaks of absorbance at 370 and 450 nm. Figure 13 depicts the absorbance 
spectrum of S-200 purified P. furiosus PEF. These data indicate and are consistent 
with the PEF complex comprising at least one flavoprotein. Sequencing data also 
verifies the identification of P. furiosus P50 as a homolog of the £ coli DFP protein. 
£ CO// DFP is a flavoprotein containing a non-covalently associated FMN moiety. 

Up to this point, flavoproteins have not been directly implicated as part of the 
replication machinery. The potential involvement of a flavoprotein in PGR 
enhancement suggests a role for redox reactions. The only redox reaction involved 
in DNA synthesis is the formation of deoxyribonucleotides from ribonucleotides, 
which is catalyzed by ribonucleoside diphosphate reductase. In vitro, the 
ribonucleoside diphosphate reductase enzyme can be coupled to NADPH via two 
known pathways involving FAD-containing oxidoreductases (Pigiet and Conley, J. 
Biol. Chem. 252:6367-72 (1977); Thelander and Reichard, Ann. Rev. Biochem. 
48:133-158 (1979)). One pathway involves thioredoxin and thioredoxin reductase. 
Interestingly, £ co// thioredoxin has been shown to act as an accessory protein and 
confer processivity on T7 DNA polymerase. An alternate hypothesis for the 
potential role of a flavoprotein in PGR enhancement is that a flavoprotein may be 
required for the continuous processing or activation of other proteins or cofactors 
n eded for nucleic acid replication. 
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Nucleotide and Amino Acid Sequence of P45 

The nucleotide sequence of the Pfu P45 protein component was obtained as 
described below. 

1 . Synthesis of P4S Sequences 

Amino temiinal peptide sequencing of purified P45 protein allowed the 

generation of four degenerate oligonucleotides designed to hybridize to DNA 

encoding the PDWKIRKE (SEQ ID NO.:69) peptide of SEQ ID NO.: 11 , as follows: 

#743: CCA GAC TGG AAA ATA AGG AAA GA (SEQ ID NO- 32) 
T G TGG 

C 
T 

#744: CCA GAC TGG AAA ATA AGA AAA GA (SEQ ID NO- 33) 
T G TGG 

C 
T 

#745: CCA GAC TGG AAA ATA AGG AAG GA (SEQ ID NO- 34) 
T G TGG 

C 
T 

#746: CCA GAC TGG AAA ATA AGA AAG GA (SEQ ID NO: 35) 
T G TGG 

C 
T 

A lambda phage P. furiosus genomic library was used as template for PGR 
amplification of the P45 sequence. The four degenerate oligonucleotides were 
used in separate reactions to prime template in one direction, in combination with 
one of the primers that border the genomic insertion of the lambda vector (-20 
primer and reverse primer). The PGR reaction was earned out as specified below: 

10 pi lOx Stratagene cloned Pfu buffer 

5 Ml degenerate P45 primer (either # 743, 744, 745 or 746) at 100 ng/pl 

2.0 Mi either reverse or -20 primer (100 ng/Ml) 

0.8 Ml 100 mM dNTP (total dNTPs) 

0.5 Ml Taq DNA polymerase (Stratagene. 5u/mI) 

0.5 Ml Taq Extender (Stratagene, 5u/mI) 

3.0 Ml Pfc genomic library (-1.2 x 10*** plaque fonning units/ml) 

78.2 Ml HjO 
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PGR cycling was carried out in a RoboGradient temperature cycler as follows: One 
cycle at 95«C for 3 minutes, followed by 30 cycles of: 95'C for 1 minute; 51»C to 
eS'C gradient for 2 minutes; 72'C for 6 minutes. 

The PGR products were separated on a 1% agarose. lx TBE gel. All primer 
combinations produced multiple bands. A pattem of four bands was consistently 
seen with primers 743. 744. and 746 in conjunction with the -20 primer. The three 
degenerate primers that formed consistent four band patterns with the -20 primer 
were able to generate the pattem at 56«C. Only primer 743 could generate the 
pattem at SS-G. The band pattem produced with the degenerate primers in 
combination with the reverse primers was less distinct and fornied only at lower 
annealing temperatures than the products generated with the -20 primer. 

2. Cloning Strategies 

Two strategies were used to isolate the P45 clone. One procedure was to 
make simplified sub-libraries of the original highly complex library and screen for an 
insert with the -20 and 743 primers. Positive sub-libraries could be diluted and 
rescreened until individual plaques containing the appropriate insert were identified. 
The other technique was to make use of VectoretteT»« technology (Genosys 
Biotechnologies), which allows PGR amplification when the sequence of only one 
end of a DNA fragment is known. In the veclorette system, genomic DNA is 
digested with a selection of specific restriction endonucleases. After digestion, the 
ends of the genomic DNA are ligated to specific vectorette units, which have the 
same cohesive termini as the genomic DNA digestion. The ligated vectorette unit 
contains a sequence complimentary to a provided vectorette PGR primer. (Arnold 
and Hodgson, PGR Methods and Applications 1: 39-42 (1991).) 

3. The Vectorette Reaction 

Fifty Ml reactions containing 100 ng of P. furiosus genomic DNA were 
digested with £co Rl, Hind III and Bsp 1061 (an isoschizomer for Cla I) in their 
recommended buffers for one hour at 37«G. Without any post-reaction treatment. 1 
Ml of the appropriate vectorette unit (Hind III, Cla I or £co Rl at 3 pmole/pl) was 
added with 6 pi oflO mM ATP. 1 mI of 100 mM DTT and 1 unit of T4 DNA ligase 
(Stratagene 4 u/pl). The reaction was cycled at the following temperatures: 20''C 
for 60 minute followed by 37»G for 30 minutes for 3 cycl s. 
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The ligated DNA was amplified according to the following: 

10 pi cloned lOx Stratagene Pfu buffer 

8.3 Ml degenerate P45 primer at 100 ng/pl 

2.0 pi 50 pmol/pl vedorette primer 

0.8 pIlOO mM (total) <m? 

0.5 pi Taq DNA polymerase (Stratagene. 5u/pl) 

0.5 pi Taq Extender (Stratagene. 5u/pl) 

1.0pl vedorette library 

76.9 pi HjO 

PGR cycling was carried out as follows: One cylce at 95»C for 1 minute followed by 
30 cycles of: SS'C for 1 minute; 56»C for 2 minutes; and 72»C for 3 minutes. 

Ten pi were loaded on an 1% agarose. 1x TBE gel. Multiple bands were 
produced by all primers except 745. To determine if all three vedorette library 
products had been correctly primed off the same target DNA (P45 sequence) rather 
than having been produced by a non-specific PGR reaction, the products were 
digested with Mnl I. Mnl I deaves at a frequent four base pair recognition sequence 
and produces a useful pattern of bands spedfic to the template digested. The 
pattern generated by eledrophoresis of the Mnl I digestion fragments of the C/a 
i/743, H/ncf 111/744 and Eco RI/744 PGR produds on a 6% acrylamide gel showed 
some variation, but the majority of bands could be identified in all three samples, 
indicating that they share large segments of identical sequence. 

4. Screening 

The PGR products from the Gla 1/743 and Hind 111/744 combinations were 
mixed and purified from free nucleotides and unused primers before being used as 
template for the generation of a 52 million cpm «P labeled probe. Details on probe 
synthesis and library screening are cited in Example 6. 

More than 60 positive dones resulted from screening with the mixed 
vedorette probe. Several positive were well situated for colledion without 
significant contamination from adjoining plaques. Twelve of these plaques were 
subjected to PGR amplification with the 743 and -20 primer as described previously 
except that an annealing temperature of Se'G was used instead of a temperature 
gradient. In the same amplification assay. 1 1 sub-libraries were assayed in the 
same manner. 
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Three of the twelve clones recovered from the primary radioactive \abe\ 
screen produced strong, single bands. Clone 1 produced a band of approximately 
5 kb. clone 3 produced a band of approximately 3.5 kb, and done 9 generated a 
band of approximately 2.7 kb. One of the sub-libraries also produced a clone of 
approximately 6.5 kb. 

5. Sequencing 

Sequencing of the P45 clones was carried out on purified PCR products 

and plasmid mini-preps made from excised cells. The nucleotide sequence of P45 

is listed below with the predicted amino add translation. The diemically-determined 

N-temiinal sequence of P45 (SEQ ID NO.: 11). used to generate the degenerate 

PCR primers (SEQ ID NO.: 32-35). is indicated by underiining. 

P45 nucleotide jseouBnen (SEQ ID NO.: 70) 

ATGCTACTTC CAGACTGGAA AATCAGAAAA GAAATACTTA TAGAGCCATT 
TTCTGAAGAA TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGGCAGAG 
AGGCTTTTGT TAAGGGGAAA TTAATCGACG TGGAAAAGGA AGGAAAAGTC 
GTTATTCCTC CAAGGGAATA CGCCTTAATC CTAACCCTCG AGAGGATAAA 
GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC AGTTTAGCAA 
GAGAAGGGGT TATTGGTTCT TTTGCTTGGG TTGACCCAGG ATGGGATGGA 
AACTTAACAC TAATGCTCTA CAATGCCTCA AATGAACCTG TCGAATTAAG 
ATATGGAGAG AGATTTGTGC AGATCGCATT TATAAGGCTA GAGGGTCCGG 
CAAGAAACCC TTACAGAGGA AACTATCAGG GGAGCACAAG GTTAGCGTTT 
TCAAAGAGAA AGAAACTCTA G 

P45 amino acid seouenet^ (seq id NO.: 71 ) 

MLLPDWKTRK FTTiTF;pfsee SLQPAGYDLR vgreafvkgk LIDVEKEGKV 
VIPPREYALI LTLERIKLPD DVMGDMKIRS SLAREGVIGS FAWVDPGWDG 
NLTLMLYNAS NEPVELRYGE RFVQIAFIRL EGPARNPYRG NYQ6STRLAF 
SKRKKL* 

The translated P45 protein has a theoretical pl of 9.12 and a calculated 
molecular weight of 17868.76. The translated N-terminal sequence (underiined) 
of P45 con^sponds to the experimentally-determined sequence (SEQ ID NO.: 11) 
and matches the exact sequence (SEQ ID NO.: 60) used to design the degenerate 
PCR primers. 

When the P45 DNA sequence is translated in all six frames and compared 
to multiple sequence databases using the computer-implemented program Blasbc. 
the dCTP deaminase gene of Desulfurolobus ambivalens was found to have 
similarities. Another entirely different gen encoding polyprotein from Visna and 
MaedaWisna virus was also identified, but at a less significant level of s quence 
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similarity. Each of these similar genes, as well as those discussed below, may 
represent sequences related to P45. Thus, sequence information stored on 
computer readable medium may be used with computer based methods to search 
for homologous, similar, or identical sequences in a database to develop further 
P45 and PEF^ncoding DNA sequences in accordance with this invention, as 
shown in example 9 below. 

Example 9 
Identification of Proteins Related to P45 
1. DNA Sequence Homology of P. furiosus P45 Protein to dCTP 
Deaminase and dUTPases 

When the P45 DNA sequence was compared to multiple databases using 
the program BlastX. the probable deoxycytidine triphosphate deaminase (dCTP 
deaminase) gene (cfccO of Desulfurolobus ambivalens was found to exhibit the 
highest degree of similarity. The biochemical properties and physiological role of 
dCTP deaminase has been studied in £ co//. where dCTP deaminase is a 
homotetramer, which catalyzes the formation of dUTP and NH3 from dCTP. DNA 
sequences with similarity to the £ coli dcd gene have been discovered in other 
bacteria {Salmonella typhimurium, Haemophilus influenzae) and in archea 
(Ouzounis, C. Kyrpides, N., and Sander. C. Nucl. Acids Res. 23:565-570 (1995); 
Bult, C.J. et al Science 273:1058-1073 (1996); Beck. C.J., Eisenhardt. A.R. and 
Neuhard, J.. J, Biol. Chem. 250:609-616 (1975); Fleischmann, R,D. et aL Science 
269:496-512(1995)). Amino acid sequence comparisons between P. furiosus P45 
and the dCTP deaminases from Desulfurolobus ambivalens CA")(also known as 
Acidianus ambivalens, Sulfolobus ambivalens), £ coli CE"), and Haemophilus 
influenza are shown below, demonstrating sequence similarity ranges from about 
39.1 to about 71.2%. 
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60 61 

-EAXVK6K -LIOVtK- 



75 76 90 
" ECKVXIPPUYAI. 



MAP Multiple Sequence Alignnenc Results 
Pagt 1.1 

1 IS H 30 )1 45 46 

1 p45 HLLPoiiKiRKt — -aitprst-noop ACTDtHToa xtva 

2 A. KIUaimilJfrYLDCC WVI5PLTO-0TIM »«r/t)UVCC"HAR nWTOtlYCDCICOPR SPYEICK rTrTTv»tfrtA« 

3 E. HRLaJUDIEAMLDCC RISimPPVWIMC ATVDVRUKKmni CHrlSiDLSCProC vSIiSv^SmL M^iSIiJSJ 

4 KAfIN KIU^DItA1fU)a IISITPHPNHCKIHG ATHWRUDCSFUm tSJSnoSJS JSS!S^SIn lo 



Pe«« 3.1 

l"* 1" "1 135 U6 150 151 1S« i«& i.i^ 

; Ji2SI5'2Z2° »«?»«"«CVIC SrAH-VOPOroCKl TUarWASHtPVtl* YCtRIVOTAniuic PAWPYll----6KiJ 

2 A. LVrmrVKLPKWMA rvHlMSrAJOClfV PPTI-VOACfXCOl. TICVtC-SArPVXIIC RCTRnJ^LinilTL? PvSpyJ-.-ISvX 

3 E. AVTIXSmPACtVC -LDGRSSLARLCIKV KVTAHAIOPCMSOCl VLEnSsSIpLAlI pSMLlSasSS SSIfSiiSS? 

4 HAEIN ATT1X5VKLPAHIXC WUBMSLAJlLGlilV KVTAHRIDPCWCKI VMPffiSCKLPUOJl pSS^ SuJiTvSg 



9*99 3.1 

1 p45 

3 A. 
3 C. 



IBI 195 196 

CSTRLArsntXH,-- 162 

coocvTLPKntrn— na 

N00GAVA5RIDXD-- 193 

4 KAIIM MOOSAVASRIDtDKE 195 



210 311 



225 226 



E. CO/I DCD exhibits an apparent molecular weight of 21.2kD (Wang, L. and 
Weiss, B. J. Bacteriol. 174:5647-5653 (1992)), while the predicted molecular weight 
of M. jannaschii DCD is approximately 22kD (204 amino acids). These molecular 
weights are approximately half the apparent molecular weight of P. furiosus P45 
and suggest that the heat-dissociated form of P45 (apparent moi. weight of 45kD) 
may actually be a dimer. 

In addition to dCTP deaminase, P. furiosus P45 exhibits a significant, but 
lower degree of sequence similarity to uridine triphosphatase (dUTPase), 
dUTPase, an enzyme encoded by the dut gene, converts dUTP to pyrophosphate 
and dUMP. An amino sequence alignment comparing the sequence of P45 to 
several dUTPases is shown below. Regions of identity are shown in black boxes, 
while regions of similarity are shown in grey boxes. 
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^7. 

One of the regions of sequence similarity between P45 and dUTPase is the 
putative uridine-binding motif. This motif is conserved in the translated amino acid 
sequence of P. furiosus P45, which reveals the presence of a putative uridine- 
binding sequence conserved in ^ synthetases. dCTP deaminases, and dUTPases 
(Koonin, E.V. Nud. Acids Res. 24:2411-2415 (1996)). In the following 
comparisons, U represents a bulky hydrophobic residue such as I, V. M, F, Y, or 
W, and the bolded residues match the G, D or U residues of the consensus 
sequence. 

Consensus uridine-binding 



10 



15 



motif: 


.GUUD. .U.G.U.U 


{SEQ 


ID 


NO. : 


72) 


P. fur. P45: 


FAWVDPGWDGNTLM 


(SEQ 


ID 


NO. : 


73) 


W, jann, DCD: 


AGWIDAGFKGKITL 


{SEQ 


ID 


NO. : 


74) 


M. jann put, DCD: 


SAVHDPGYEGRPEY 


(SEQ 


ID 


NO. : 


75) 


D. sulf. DCD: 


PTIVDAGFEGOLTI 


(SEQ 


ID 


NO. : 


76) 


£. coii DCD: 


AHRIDPGWSGCIVL 


(SEQ 


ID 


NO. : 


77) 


E. coll DUT: 


VGLIDSDYQGQLMI 


(SEQ 


ID 


NO. : 


78) 


Yeast DUT: 


AGWDRDYTGEVKV 


(SEQ 


ID 


NO. : 


79) 


Human DUT: 


AGVIDEDYRGNVCV 


(SEQ 


ID 


NO. : 


80) 


Herpes virus DUT: 


TGLIDPGFQGELKL 


(SEQ 


ID 


NO. : 


81) 



Each of these proteins represent activities, such as dUTPase, that may be 
used as a polymerase enhancing activity or PEP. One skilled in the art can identify 
numerous other proteins using stored sequence information, in the appropriate 
computer readable medium, from this disclosure and analogous searching 
procedures in other databases. Database information on each of the following 
species can specifically be used to identify PEF using one or more of the 
sequences, or parts thereof, identified herein: Pyrococcus furiosus; Pyrococcus 
horikoshii; Pyrobaculum aerophilum; Sulfolobus solfataricus; Archaeogtobus 
fulgidus; Aquifex pyrophilus; Methanococcus jannaschii; Themotoga maritima; 
Methanobacterium thermoautotrophicum; and Thermus thermophilis. 

The physiological function of dCTP deaminase has only been studied in £ 
CO//, where it plays an essential role in deoxyribonucteotide metabolism. dCTP 
deaminase converts dCTP to dUTP, which is an obligatory step in the de novo 
synthesis of thymidylate in bacteria (Taylor, A.F., and Weiss, B. J. Bacteriol. 
151:351-357 (1982)). In turn, uridine triphosphatase (dUTPase encoded by the dut 
gene), a ubiquitous enzyme found in bacteria, eukaryotes, and eukaryotic viruses, 
degrades dUTP to pyrophosphate and dUMP, the thymidylate synthetase substrate, 
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Thus. dCTP deaminase and dUTPase are functionally linked, with mutations in the 
dcd gene suppressing dut mutations (Wang. L. and Weiss. B. J. Baderiol. 174: 
5647-5653(1992)). 

dUTPase has shown to be an essential gene in £ coli and in yeast (El-Hajj. 
H.H., Zhang. H., and Weiss, B. J. Bacteriol. 170: 1069-1075 (1988); Gadsden, 
M.H.. et al. EMBO J. 12:4425^31 (1993)) because it functions in vivo to prevent 
dUTP incorporation into DNA. In £ coli dut mutants, the dUTP pool is greatly 
enhanced, resulting in an increased substitution of uracil for thymine in DNA during 
replication. Uracil-DNA glycosylase and exonuclease III play an essential role in 
repairing uracil-containing DNA in £ coli dut mutants (Taylor, A.F. and Weiss, B.. J. 
Bacteriol. 151:351-357(1982)). 

The substrate specificities, enzyme activities, and physiological role of dCTP 
deaminase and dUTPase had not been characterized in archea prior to this 
disclosure. 

Example 10 

Expression Cloning and Characterization of Recombinant P45 
1 . Expression Cloning of P45 by the Method of Ligation 
Independent Cloning (UC) 

Recombinant P45 was produced by PGR amplification of a portion of a P45 
genomic clone (done #9). The primers (oligos # 1 and 2 below) were designed to 
function with the Affinity Protein Expression and Purification System (Stratagene; La 
Jolla. OA), which uses Ligation Independent Cloning (LIC). 

Oligo # 1 . 5' eACGACGACAAGfilSCTACTTCCAGACTGGAAA 3' (SEQ ID NO: 82) 
Oligo # 2. 5" GGAACAAGACCC6TCCCACTTTCACAGATGAAGAG 3' (SEQ ID NO: 83) 

The bold letter segments represent sequences specific to the cloning vector 
while the adjoining sequence is specific to the clone #9 sequence. The ATG 
underiined in oligo #1 corresponds precisely to the 5' end of the P45 gene, while the 
sequence after the bold letters in oligo #2 corresponds to the end of the genomic 
insert. 

The PGR amplification was carried out in a volume of 100 pi containing: 1x 
Cloned Pfu Polymerase Buffer; 0.2 mM dNTPs (each); 200 ng of Primer oligo #1; 
200 ng of Primer oligo #2; 3 \i\ Genomic done #9 plaque core in 500 pi SM buffer 
(-2000 Lambda partides); 2.5 units Cloned Pfu DNA Polymerase; and 7 ng Native 
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PEF (where 10x Cloned Pfu Polymerase Buffer is: 100 mM KCI; 100 mM 
(NHJjSO,; 200 mM Tris-CI (pH 8.75); 20 mM MgSO,; 1% Triton® X-IOO; and 1000 
fjg/ml BSA). 

The thenmocycling parameters were: 95'C for 1 minute (1 cycle); 95X for 1 
minute - 56 X for 1 minute - 72*C for 5 minutes (30 cycles). 

The 2.5kb amplified product was purified and then subjected to limited 
nucleotide excision in the presence of dATP. This protocol promotes removal of 
nucleotides at the 3' temriini of the PCR product until an adenine residue is reached. 
The excision mixture (consisting of: 1x Cloned Pfu Polymerase Buffer 0.5 mM 
dATP; 43.5 pi PCR product (8.7 ng/pl); 1.25 units Cloned Pfu DNA polymerase) 
was incubated at 72°C for 1 0 minutes. 

20 pi of the exonuclease treated PCR product was annealed with 40 ng of 
digested pCAL-n-EK vector (exonuclease treated to produce ends complimentary to 
the sequence exposed in the PCR product) for 45 minutes at room temperature. 
The amount of insert molar excess, relative to vector, was approximately 9 fold. 
The pCAL-n-EK vector contains an upstream, in-frame calmodulin binding peptide 
tag, which allows the N-terminal fusion protein to be easily purified on calmodulin 
agarose (CAM agarose). Various other expression vectors, which may or may not 
produce fusion proteins, are known in the art and can be used to express P45 
protein or fragments thereof or to produce DNA constaicts with a sequence 
encoding P45 protein or fragments thereof. An expression vector need only contain 
DNA sequences operating to permit or control transcription from an appropriately 
linked nucleic acid. The type of control, the degree of transcription permitted, and 
the manor in which the vector and nucleic acid are appropriately linked may vary. 
Generally, an expression vector also contains a replication control sequence to 
allow the vector to replicate in a host. However, replication control sequences are 
not required where replication of the host is not crucial to expression. 

2. Purification of Recombinant P45 

Five microliters of the annealed vector/insert DNA was transfonmed into 
XL2-Blue competent cells. Ten of the resultant colonies were screened by PCR for 
the 2.5 kb insert and 9 w re found to contain the connect size insert. Plasmid DNA 
was prepared from three clones and then used to transform BL21(DE3) cells. Six 
BL21(DE3) colonies were grown for approximately 10 hours in 5 ml of 1x LB, 125 
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Mg/ml ampicillin at 37**C. These cultures were used to inoculate six flasks 
containing 250 ml 1x LB and 125 pg/ml ampicillin. When the optical density {ODjoo 
values) of the cultures reached 1.1 to 1.3, IPTG was added to a final concentration 
of about 1 mM. The cells were grown overnight at 37X. Cells were collected by 
centrifugation and the pellet estimated (visually) to be in the range 0.4 • 0.5 gm. 
The pellet was resuspended in 2.5 ml of calcium binding buffer 50 mM Tris-HCI 
(pH 8.0): 150 mM NaCI; 10 mM P-ME; 10 mM magnesium acetate; 2 mM CaClj. 

250 pl of 10 mg/ml lysozyme was added to the cells and the reaction was 
allowed to incubate on ice for one hour. The slightly lysed samples were sonicated 
twice with the Branson Sonifier 250. the microtip at a duty cycle of 50% and a 
setting of 4. The lysate was cleared by superspeed centrifugation. Cleared lysate 
was added to 700 pi of washed Stratagene Calmodulin agarose (50% CAM agarose 
by volume) and allowed to bind with gentle rocking at 4*C for 1 hour. The resin was 
washed 3 times with 10 ml of CaClj binding buffer and twice with 5 ml of the same 
solution except that the CaClj concentration was reduced to 0.1 mM. Washing was 
accomplished by centrifugation. removal of the supernatant, and resuspension in 
fresh buffer. 5 pi of the CAM resin was collected to examine on SDS-PAGE 
electrophoresis. The washed matrix was resuspended in 900 pi of elution buffer [50 
mM Tris-HCI (pH 8.0); 150 mM NaCI; 10 mM p-ME; 2 mM EGTA] and allowed to sit 
for one minute prior to pelleting of the agarose resin. The elution buffer containing 
P45 was removed to a separate tube and the CAM agarose was resuspended 
again in 900 pi of elution buffer. Next, a high salt elution buffer [50 mM Tris-HCI 
(pH 8.0); 1000 mM NaCI; 10 mM P-ME; 10 mM magnesium acetate; 2 mM EGTA] 
was used to elute remaining P45 protein from the CAM agarose: 

Subsequent SDS-PAGE analyses showed that the high salt elution buffer 
released a majority of the recombinant P. furiosus P45 protein from the column 
matrices. Some residual protein remained bound to the calmodulin agarose. 

The method described here to pnDduce P45 protein can be modified in 
numerous ways by methods known in the art. (Ausubel. P.M.. et al. (1989) Current 
Protocols in Molecular Biology, Greene Publishing Associates and Wiley- 
Interscience, New York, NY; Sambrook. J., et al. (1989) Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press. Cold Spring 
Harbor, N.Y.) In one possible modification, a P45 analog protein can be produced. 
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For example, a mutation can be introduced into the P45 coding region. Any type of 
mutation can be used including site-specific point mutation, deletion mutation, 
insertion mutation, and multiples or combinations of these mutations. This mutant 
coding region is inserted into an appropriate vector, which is transferred into a host 
cell. The host cell then expresses the P45 analog. A P45 analog protein 
substantially retains one or more of the PEF activity or dUTP or dCTP metabolic 
activities described herein. Thus, the fusion protein and affinity tag expression and 
purification system described here is only one of many ways to produce a 
recombinant PEF protein such as recombinant P45. 

Analogs may be constructed by, for example, making various substitutions of 
residues or sequences or deleting terminal or intemal residues or sequences not 
needed for biological activity. For example, cysteine residues can he deleted or 
replaced with other amino acids to prevent formation of unnecessary or incon^ct 
intramolecular disulfide bridges upon renaturation. Other approaches to 
mutagenesis involve modification of adjacent dibasic amino acid residues to 
enhance expression in yeast systems in which KEX2 protease activity is present 
Other mutations can be made that favor expression in various host cells. 

Typically, substitutions may be made conservatively. For example, one may 
substitute amino acids that have physiochemical characteristics resembling those of 
the residue to be replaced. Similarly, when a deletion or insertion strategy is 
adopted, the potential effect of the deletion or insertion on biological activity should 
be considered. In order to preserve the biological activity, deletions and 
substitutions will preferably result in homologous or conservatively substituted 
sequences, meaning that a given residue is replaced by a biologically similar 
residue. Examples of conservative substitutions include, but are not limited to, 
substitution of one aliphatic residue for another, such as He, Val, Leu, or Ala for one 
another, or substitutions of one polar residue for another, such as between Lys and 
Arg; Glu and Asp; or Gin and Asn. In view of this specification, one skilled in the art 
will be able to make other such conservative substitutions, for example, substitutions 
of entire regions having similar hydrophobicity characteristics 

Mutations in nucleotide sequences constructed for expression of an analog, 
in most instances, should pres rve the reading frame phase of the coding 
sequences and preferably will not create complementary r gions that could hybridize 
to produce secondary mRNA structures such as loops or hairpins which would 
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adversely affect translation of the receptor mRNA. Although a mutation site may be 
predetermined, it is not necessary that the nature of the mutation per se be 
predetermined. For example, in order to select for optimum characteristics of 
mutants at a given site, random mutagenesis may be conducted at the target codon 
and the expressed mutants or analogs screened for the desired activity. 

Not all mutations in the nucleotide sequence which encode the protein will be 
expressed in the final product. For example, nucleotide substitutions may be made 
to enhance expression, primarily to avoid secondary structure loops in the 
transcribed mRNA (see EPA 75,444A, incorporated herein by reference), or to 
provide codons that are more readily translated by the selected host, e.g., the well- 
known E. coli preference codons for E. coli expression. 

Mutations can be introduced at particular loci by synthesizing 
oligonucleotides containing a mutant sequence, flanked by restriction sites enabling 
ligation to fragments of the native sequence. Following ligation, the resulting 
reconstructed sequence encodes an analog having the desired amino acid insertion, 
substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures 
can be employed to provide an altered gene having particular codons altered 
according to the substitution, deletion, or insertion required. Exemplary methods of 
making the alterations set forth above are disclosed by Walder et al. (Gene 42:133, 
1986); Bauer et al. (Gene 37:73. 1985); Craik (BioTechniques. Jan. 12-19, 1985); 
Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 1981); 
and U.S. Pat. Nos. 4,518.584 and 4,737,462, which disclose suitable techniques, 
and are incorporated by reference herein. 

3. Comparison of Recombinant P45 and Native PEF PCR 
Enhancing Activity 

The 5.2kb "On/OfP* assay, described in example 1, was used to demonstrate 
that recombinant P45 (rP45) possesses PEF activity comparable to a natural PEF, 
The resulte are shown in Figure 14. When no PEF activity is added, the 5.2 kb 
amplification product is not generated as shown in the lane labeled "none" of Figure 
14. When proteins produced from the recombinant clones of P45 (called "1" or "23" 
from their plasmid delineation) were added to the On/Off assay in amounts of 5Mi, 1 
pi. or 1 Ml of a 1/10 dilution (approximately 100. 20 and 2 ng, respectively), they all 
showed enhancement of PCR product yield comparable to the two native PEF 
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preps. Interestingly. 5 pi worked less well than 1pl. An "overdose" phenomenon 
was also noted with native PEF (example 16). 

4. Molecular Weight of Recombinant P45 Compared to Native PEF 
Complex and Protein Components of the Complex 

The predicted molecular weight of the translated rP45 DNA sequence was 
18.6 kDa. However, the native P45 component migrates as pari of the PEF 
complex at 300 kDa without heat denaturation, and at 45 kDa with heat 
denaturation (Sg-C for 5 minutes, partially denatured). Protein complexes in 
hypenhermophiles are unusually stable and sometimes require extreme conditions 
before disassociation occurs. We have found that P45 migrates at approximately 
18kD, or approximately 17-18kD. instead of 45kD when the native PEF complex is 
boiled in the presence of 2% SDS and 1% trichloroacetic acid (TCA) (fully 
denaturing conditions)(Figure 15). Without heat denaturation. more than half of the 
recombinant P45 migrated at approximately 50 kDa (45 + 4 kDa calmodulin binding 
tag), while the remaining sample migrated close to 26 kDa (Figure 15). With heat 
denaturation all of the sample migrated at approximately 26 kDa, suggesting that 
the recombinant is a dimer in its undenatured state. Trichloroacetic acid heat 
treatment failed to produce any additional bands in the recombinant sample. 

The migration differences between the fully denatured forms of the native 
(18kD) and recombinant P45 (26kD) might be explained by differences in post- 
translational modifications between P. furiosus and the £ co// host or by the effects 
of the 4kD GBP (calmodulin binding protein) tag on the folding or migration of the 
rP45 fusion protein. 

Example 11 

Potential Mechanisms of PCR Enhancement by P45 
1 . Identification of the Reaction Catalyzed by PEF/P45 

The similarity of P45 to dCTP deaminases prompted us to test whether 
dCTP or other nucleotide triphosphates were substrates of PEF/P45. PEF was 
incubated with dCTP or dUTP and the reaction products were separated by reverse 
phase HPLC. Reaction mixtures (50^1) containing 1x cloned Pfu polymerase buffer, 
10mM dCTP or dUTP, and 700ng PEF (or 5;^! of cloned p45 (-100 ng) or nothing 
(neg. control)) were incubated at 72''C for 1 hour. The samples were injected on a 
3.9 X 250 cm Waters Delta-pak C-1B column (300 angstrom/1 5;im). equilibrated in 
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50mM triethylammonium acetate. pH(7.0) (A). Products were eluted with 
acetonitrile (B) at 2 ml/min. using the following gradient: 0% B for 5 minutes. 0-10% 
B over 20 minutes. Absorbence of the eluate was monitored with a photodiode 
array detector, and peal( areas were integrated at 260nm. 

The data in Figure 16 show that both the native PEF complex and purified 
recombinant p45 utilize dUTP as a substrate. In the presence of cloned Pfu DNA 
polymerase PGR buffer. dUTP migrates at 10.517 minutes. After a one hour 
incubation with PEF or recombinant P45 (purified from clone 1 or 23). the dUTP 
peak disappeared (0.1% total peak area) and a new peak appeared eluttng at 
4.400 minutes (98% total peak area). The product of the PEF + dUTP reaction 
migrated with the same retention time as a dUMP standard, which was different 
from the retention times of dUDP. dCMP. dCDP, and dCTP. To confirm that the 
product was dUMP. a dUMP standard and the PEF + dUTP product were mixed 
together and re-injected. The mixed sample produced only one peak. 

In addition to dUTP. dCTP was also found to serve as a substrate for PEF 
(Figure 17). The product of the PEF + dCTP reaction appeared to be dCMP. based 
upon its retention time and absorbence maxima. Although PEF utilizes dCTP. 
dUTP is the preferred substrate of PEF/P45. In the example shown, only 16.6% of 
the dCTP was converted in 1 hour under the same conditions which converted 
100% dUTP (Figure 17. panel B). Moreover, when dCTP and dUTP were 
incubated together with PEF. only dUMP (4.2 minute peak in panel C) was 
generated. No reactivity with PEF was detected for the following nucleotides: 
dGTP. dATP, dTTP, dCMP, dUMP. and rUTP under the conditions used. 

The substrate preference and reaction catalyzed by PEF/P45 was 
significantly different from that predicted based upon the amino acid similarity to 
dCTP deaminases. Although both enzymes bind dCTP and dUTP, the fonward 
reaction catalyzed by dCTP deaminase is the deamination of dCTP to produce 
dUTP + NHj. PEF/P45. on the other hand, preferentially utilizes dUTP, and 
catalyzes the release of pyrophosphate (PPi). No deamination of dCTP by P. 
furiosus PEF or P45 was observed under the conditions used. 
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2. Enzymatic Detection of Inorganic Pyrophosphate (PP,) 
Produced by PEF from dUTP 

We tested whether inorganic pyrophosphate (PP,) or inorganic phosphate 
(P.) was generated during the PEF-catalyzed conversion of dUTP to dUMP (Figure 
16). To assess whether P, was produced. 900 or 1800 ng of PEF was incubated in 
the presence of 5 mM dUTP in 1x cloned Pfu polymerase buffer for 1 hour at 72'C. 
The reaction mixture was subsequently analyzed for the presence of P, using the 
method of Heinonen and Lahti (Heinonen. J.K. and Lahti. R.J. (1981) Anal. 
Biochem. 1 13: 313-317). Previous results (HPLC) demonstrated that under the 
above reaction conditions. 100% of the dUTP should be converted to product. 
One hundred percent (100%) conversion coaesponds to the production of 500 
nmol of P„ if P, is a product of the reaction. However, no P, was detected In these 
assays. The assay for P, was sensitive enough to detect as little as about 50 
nmoles of P, These observations show that P, is not fomied during the reaction of 
PEF with dUTP. 

In experiments to investigate whether PP, is fomied during the reaction, 900 
or 1800 ng of PEF was incubated with 10 mM dUTP in 1x cloned Pfu polymerase 
buffer for 1 hour at 72'C. The presence of PP, in the reaction mixture was then 
quantified using Sigma's "Enzymatic Determination of Pyrophosphate" l<it (Sigma 
Product No. P7275). The Itit utilizes a coupled enzyme system whereby two moles 
of NADH are oxidized to NAD' for each mole of PP, present in the reaction mixture. 
The oxidation of NADH is monitored spectrophotometrically at 340 nm. Using this 
assay system, the production of PP, was cleariy established in reactions that 
contained PEF and dUTP. No PP, was detected in control reactions that ladced 
PEF or that contained PEF and dATP (in place of dUTP). The reactions that 
contained 1800 ng of PEF produced twice as much PP, as those that contained 
900 ngofPEF. 

The temperature optimum (T^) for the generation of PPi from dUTP was 
measured by incubating mixtures of dUTP and native PEF at temperatures ranging 
from 73''C to 99'C. PPi production by native PEF increased steadily over this 
temperature range and was highest at 99°C. While the T,,, for native PEF was 
found to >99*C, recombinant P45 exhibited maximal activity between 85°C and 
93'C when tested at the same nzyme concentration (dUTP tum-over), but at a 
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lower total protein concentration. Accordingly, specific temperature ranges, such as 
from about 70*C to about 100X, from about 85X to about or temperatures 
above about 70**C. can be used with a PEF or polymerase enhancing activity of the 
invention. 

The enzyme activity of three different preparations of S200-purified native 
PEF was measured at 85*'C. Protein concentrations were detennined by both 
Bradford and by amino acid analysis. Shown below is a summary of the enzyme 
activity and specific activity (protein concentration determined by Bradford or AAA 
as indicated) of S200-purified native PEF, These activites were compared to the 
minimum amount of purified PEF required to amplify the 5.2kb target in the "on/ofT 
assay described in example 1 (100p! PGR). 
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The data indicate that for purified PEF preps 1 and 2, there is an excellent 
agreement between dUTPase activity and PCR enhancing activity. However, native 
PEF prep 3 exhibited 2-4 times less PCR enhancing activity, possibly due to its 
lower specific activity. Prep 3 may contain contaminants, which interfere with PCR 
enhancement. 

PPi formation from dCTP was also measured by substituting dCTP for 
dUTP. For native PEF prep 1 , the level of dCTPase activity was found to 
0.097(Mmole PPi/hr/pl) at 85"C, which is 12-fold lower than the rate of PPi 
production from dUTP. In addition, recombinant P45 preparations were also tested 
and found to produce PPi from both dUTP and dCTP. Native PEF and recombinant 
P45 exhibited a similar degree of preference for dUTP, as compared to dCTP. 
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Therefore, recombinant P45 and structurally similar P45 proteins catalyze 
this reaction in the absence of any of the other components of the PEF complex. 
The methods described here for the PP, detection are, thus, a useful tool for 
analyzing the activity of or determining the presence of native PEF, PEF analog 
proteins, recombinant or synthetic PEF proteins, PEF complexes, and rP45. 

3. PEF/dUTP Product Characterization with Mass Spectrometry 
Electrospray mass spectral analysis was performed to characterize the 

byproduct of reactions employing PEF and dUTP. Analysis in the negative 
ionization mode produced a relatively clean spectrum exhibiting peaks at 307 and 
615 m/e. These masses are consistent with those of dUMP [M-H]- and its non- 
covalent dimer [2M-H]*. Analysis in the positive mode gave a complex an-ay of 
peaks. 

4. Role of PEF/P45 in Eliminating dUTP Accumulation During PCR 
An understanding of the catalytic activity of PEF/P45 has provided insight 

into the mechanism by which PCR enhancement occurs, Lasken et at, have 
reported that archeal DNA polymerases, such as Vent, incorporate dUTP at 
approximately 40% the rate of TTP (Roger S. Lasken, David M. Schuster, and 
Ayoub Rashtchian, (1996) J. Biol. Chem. 271; 17692-17696). However, further 
DNA synthesis by archeal DNA polymerases appears to be inhibited by dU- 
containing DNA. Inhibition appears related to the 6500-fold greater affinity of Vent 
for dU-containing sites, as compared to dT-containing DNA. Based upon these 
observations. Lasken has proposed that archeal DNA polymerases may play a role 
in repairing dU-containing DNA in vivo. 

One possible mechanism for the PCR enhancing activity of PEF/P45 is that 
its associated dUTPase activity may convert any dUTP present during PCR to 
dUMP, a side-product which should not interfere with DNA polymerase activity. By 
so doing. dUTP would not be available for incorporation into the PCR product, and 
hence dU-DNA inhibition of the DNA polymerase would not occur. Such a 
mechanism is consistent with the increased PCR product yields generated by 
archeal DNA polymerase in the presence of PEF/P45. 

dllTP. however, is not intentionally added to PCR reactions but may be 
generated by spontaneous deamination of dCTP during the high temperatures used 
in PCR cycling (Jens-Peter Horst and Hans-Joachim Fritz, (1996) The EMBO 
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Joumal 15; 5459-5469). To assess the extent of dCTP deamination to dUTP during 
PCR. we incubated dCTP (lOmM) at 95»C (PCR denaturation temperature) in the 
presence of cloned Pfu DNA polymerase PCR buffer. The products of heat 
treatment were analyzed by reverse phase HPLC (as described above) and peak 
assignments were made by comparing retention times to those of standards. 
Figure 18 shows the results obtained when dCTP is heated for 1 hour (panel B) or 4 
hours (panel C) at 95°C. dCTP breaks down into 2 predominate species. dCDP 
(22%/1 hr; 41%/4hr.) and dCMP (2.8%/1 hr.; 25%/4 hr.). A minor peak at 12.38 
minutes was also produced (.049%/1 hr.; .1 16%/4 hr.). which was identified as 
dUTP, based upon its retention time (dUTP standard = 12.10 minutes; panel D) and 
absorbence maximum. As shown in Figure 19. the 12.38 minute peak generated by 
heating dCTP (spectrum 2) exhibits maximal absorbence at approximately 260nm. 
as does the dUTP standard (spectrum 3). while dCTP absorbs maximally at 
approximately 270nm (spectrum 1). 

dUTP production was also measured during PCR cycling. A dCTP solution 
(lOmM dCTP in cloned Pfu DNA polymerase PCR buffer) was subject to PCR 
cycling in a RoboCycler40 using the cycling conditions described for the 5.2kb 
'on/or system (example 1). Products generated during PCR were analyzed by 
HPLC as described above. After 30 cycles, the following products were evident: 
79% dCTP. 19% dCDP. 1.8% dCMP, and .064% dUTP (Figure 20. panel A). The 
dUTP generated during PCR cycling of dCTP exhibited a retention time (12.1 min.) 
and absorbence maxima (263nm) which were very close to those of the dUTP 
standard (11.9 min.; 263nm) (Figure 20, panel D). When dCTP was cycled in the 
presence of Pfu DNA polymerase (SOX PCR conditions. lOmM dCTP and 1,25U 
Pfu/^l reaction mix) there was no difference in the amount of dCDP. dCMP. or 
dUTP produced (Figure 20. panel C). However, when dCTP was cycled in the 
presence of PEF (50X PCR conditions, lOmM dCTP and 0.5ng PEF//il reaction 
mix), the following products were produced: 73% dCTP. 19% dCDP. and 8% dCMP 
(Figure 20. panel B). There was no detectable dUTP generated when dCTP was 
cycled in the presence of PEF. consistent with an associated dUTPase activity of 
PEF. 

The increase in dCMP production in dCTP + PEF samples (8%) as 
compared to dCTP + Pfu samples (1.7-1.8%) shows that in addition to eliminating 
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the minor dUTP deamination product during PGR, PEF will also convert dCTP to 
dCMP. In this experiment (50x PGR conditions), the final dGTP concentration post- 
PCR was 73% in PEF-containing reactions and 79% in those lacking PEF. This 
slight drop in the dCTP pool is not anticipated to affect PGR product yield or DNA 
polymerase replication fidelity significantly. However, it is anticipated that the use of 
higher amounts of PEF in PGR (»1ng per 100^1 reaction) will be deleterious due to 
dCTP reactivity. If high amounts of PEF are used, it is possible that the dGTP pool 
could fall below levels required for maximal yields and lowest misinsertion rates. As 
described elsewhere, we have observed inhibition of replication or amplification 
and/or smearing of products with the use of excessive amounts of PEF. 

5. PEF Reverses Inhibitory Action of dUTP in Amplification 
Reactions 

Although Lasken reported that the incorporation of dUTP in the nascent 
DNA strand only inhibited archael polymerases by 40% (Lasken, et al. (1996) J. 
Biol. Ghem, 271; 17692-17696), we unexpectedly found that the presence or 
addition of small amounts of dUTP into PGR reactions had more dramatic inhibitory 
consequences, as demonstrated below. 

A relatively small (0.9kb) fragment of the human a 1 -antitrypsin gene was 
amplified in the absence or presence of dUTP. PGR reaction mixtures contained 
the following (in a 100 pi volume): 1x Gtoned Pfu polymerase buffer; 200 pM each. 
dGTP, dGTP, dATP; 200 ng oligo F91-23 (100 ng/pl); 200 ng oligo R980-23 (100 
ng/Ml); 125 ng Human genomic DNA; 2.5 units Pfu DNA polymerase; 200 pM total 
of (dTTP and dUTP) or (dTTP + PEF generated dUMP). 

F91-23 5' GAGGAGAGCAGGAAAGGTGGAAC 3' (SEQ ID NO: 84) 

R980-23 5' CTCCATGTCCCAACTCCGATCAC 3' (SEQ ID NO: 85) 

PEF generated dUMP was prepared as described in Example 11, section 1, and 
purified by reverse phase HPLG. 

PGR cycling was earned out as follows: 95X for 1 minute (1 cycle); 95^0 
for 1 minute - 58*G for 1 minute - 72X for 2 minutes (30 cycles). 

The PGR products were examined on a 1% agarose. 1x TBE gel as shown 
in Figure 21. The amplification of the 900 bp product from human genomic DNA 
was completely inhibited with dUTP concentrations as low as 2mM (1% of the 
dTTP+dUTP pool) and partially inhibited at 0.2mM (0.1%). The concentration of 
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individual nucleotides in a standard PCR reaction is about 200mM. dUTP inhibition 
can be completely prevented when PEF or the recombinant P45 protein (rP45) is 
added to PCR reactions containing dUTP. Seven ng of PEF can reverse the 
inhibition caused by as much as 20 pM dUTP (data not shown). The preferred 
amount of PEF or P45 used in a particular reaction can be optimized according to 
the principles provided here or by methods for quantifying amplification reactions 
known in the art. 

Unlike dUTP. the PEF generated byproduct, dUMP. was not inhibitory in Pfu 
polymerase-based PCR reactions, even when present at concentrations of 20 pM. 
In Figure 21 , amplifications containing purified dUMP appear less robust than other 
bands on the gel. It should be noted that the other PCR products on this gel were 
generated by Pfu in the presence of PEF or rP45. which has been demonstrated to 
enhance PCR product yields as compared to amplifications conducted in the 
absence of PEF/rP45. These results are consistent with the enhancing activity of 
PEF and rP45 being a result of dUTPase activity. The dUTPase activity may 
hydrolyze dUTP and thereby prevent the incorporation of dUTP into DNA. As 
demonstrated in Figure 21. dUTP incorporation by Pfu DNA polymerase during 
PCR can significantly decrease PCR product yields. Accordingly, the invention 
comprises a method of enhancing nucleic acid replication or amplification reactions 
by reducing the dUTP concentration or preventing the incorporation of dUTP into 
replicated or amplified products, as well as compositions that are capable of 
preventing that incorporation. 

We also tested whether PEF/rP45 could reverse the inhibition caused by 
uracil-containing DNA. PCR amplification was carried out in the presence of a third 
unrelated primer, which contains 9 dUs instead of dTs (dU oligo). Primers 
complementary to M13 DNA were synthesized. 

Control Oligo 5'GGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGT 3'{SEQ ID NO: 86) 
dU Oligo 5'GGUUDUCCCAGUCACGACGUUGUAAAACGACGGCCAGU 3' (SEQ ID NO:87) 

The 900 bp al-antitrypsin fragment was amplified in the presence of the 
oligos, added at levels ranging from 200ng (16 pmole) to 0.2 ng (0.016 pmole) per 
lOOul reaction. In addition, similar reactions were performed with Taq DNA 
polymerase instead of Pfy DNA polymerase. In Figure 22, the control 
oligonucleotide shows inhibition of the Pfu-based PCR reaction at high 
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concentrations (16 and 8 pmole). In contrast, inhibition by the dU oiigo was readily 
observed at concentrations as low as 0.8 pmole. It seems likely that both 
oligonucleotides sequester Pfu DNA polymerase away from the PGR primers and 
genomic DNA, but the dU oligo can inhibit at lower concentrations. These results 
show that Pfu DNA polymerase may be more tightly bound to a dU containing 
substrate. With the addition of PEF, a small amount of amplification product is 
observed in reactions containing 0.8 pmole of dU containing primer 

The enhancement by PEF in the dU oligo-inhibited reaction could be 
achieved through at least two possible pathways. The most likely explanation is 
that PEF is having no effect on the dU containing oligonucleotides and is simply 
increasing the activity of Pfu DNA polymerase by scavenging dUTP generated 
during PGR by heat- or chemically-induced deamination of dGTP (eg., lanes 2 and 
3 of Figure 22). Or. PEF may work with Pfu DNA polymerase to remove the uracil 
moieties from the oligonucleotides through a repair pathway. Thus. PEF may also 
be used in repair reactions employing appropriate polymerase activities. Repair 
reactions are known in the art and methods to adopt the use of PEF into those 
reactions can be devised by those skilled in the art. 

The knowledge of potential PEF mechanisms of action described here 
allows those skilled in the art to employ other dUTP modifying enzymes in 
enhancing polymerase reactions. A definition or one of these other modifying 
enzymes can be an enzyme that diminishes the capacity to incorporate dUTP by 
polymerases or at least partially inhibits dUTP incorporation. Assays used to 
identify and characterize PEF as described herein can also show other dUTP 
modifying enzymes. These other modifying enzymes could also mimic the 
enhancing attributes of PEF or a particular protein, such as P45 or rP45. An 
example of this class of enzyme would be dUTP pyrophosphatases (EG 3.6.1.23), 
such as deoxyuridine 5'-triphosphate nucleotide hydrolase, as well as other 
enzymes involved in dUTP metabolism, catabolism, or synthesis. These other 
enzymes may be used alone or in combination with PEF or other proteins or 
enhancing additives. 

Furthemiore, the presence of the consensus uridine-binding motif or the 
related sequences shown herein can also be used to define an enzyme or protein 
that is a PEF. Thus, a protein the comprises SEQ ID NO.: 72, or any one of SEC 
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ID NOs.: 72-81, or combinations of these sequences, may be a PEF according to 
this invention. 

Example 12 

Identification of Other Proteins for Enhancing Polymerase Activity 
The stoictural infomnation, in the amino acid and nucleotide sequences, as 
well as the functional infomfiation described here allow one skilled in the art to 
identify polymerase enhancing and/or dUTPase activities from a variety of sources. 
For example, we have shown above how degenerate probes made from the amino 
add sequences of P50 and P45 can be used to clone nucleotide sequences 
encoding polymerase enhancing and dUTPase activities, or PEF. Since we have 
identified the importance of dUTPase activity in controlling and enhancing 
polymerase reactions, such as PGR. structural information available for any 
dUTPase can be put to a new and advantageous use in identifying and producing 
proteins for enhancing polymerization reactions. Furthermore, the assays 
described can be used to identify the presence of dUTPase activity from any 
source. 

1 . Cloning Human dUTPase as a Representative Eulcaryotic 
Protein for Enhancing Polymerization Reactions 

To determine if other enzymes with dUTPase activity could also produce 
polymerase enhancing activity, we cloned a representative eukaryotic protein, 
human dUTPase. Total RNA was isolated from human placenta and converted to 
cDNA as follows: 5 pi total human RNA. 5 pi oligo dT (0.1 pg/pl), 1 pi Moloney 
murine leukemia virus reverse transcriptase (40 u/pi), 1 pi 100 mM dNTPs, 5 pi lOx 
first strand buffer, 33 pi DEPC-treated water (where 1x first strand buffer is 50 mM 
Tris-HCL (pH 8.3). 75 mM KCI. 10 mM DTT, and 3 mM MgCy. The reaction was 
incubated at 37*C for one hour. A negative control was run in parallel without 
reverse transcriptase. 

Primers containing a sequence specific to the 5' and 3' termini of one of the 
human dUTPase genes were synthesized and are shown below. The accession 
numbers for the cDNA sequence of Human deoxyuridine triphosphatase (DUT) are 
gi|1421817|gb|U62891|HSU62891. These primers also shared sequence with the 
vector pCAL-n-EK (in bold print below), which allowed ligation independent cloning 
(LIC) of the amplified product, as described in Example 10. 
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Primer 285 gacgacgacaagatgccctgctctgaagagacacc (SEQ ID NO • 88) 
Pnmer 286 ggaacaagacccgtttaattctttccagtggaacc (SEQ ID NO.: 89) 

Prior to PGR, the reverse transcriptase was heat inactivated by incubating the 
reaction at 80X for 5 minutes. The dUTPase sequence was amplified in a 100 pi 
reaction containing 1x cloned Pfu polymerase buffer. 200ng of each primer. 200 pM 
dNTPs, 2.5 units of Pfu DNA polymerase. 3 ng of PEF complex and 3yl of human 
placenta cDNA from the previous section. 

The reactions were amplified under the following conditions: 95 X for 3 
minutes (1 cycle); 95X for 1 minute - 50»C for 1 minute - 72T for 2 minutes 
(30 cycles). The amplified reaction was examined on a 1% agarose gel to confinn 
that the product exhibited the correct size before purification. The purified product 
was cloned into the expression vector pCAL-n-EK. as described in Example 10. and 
transformed into XLI-Blue cells. Three clones were confimied to contain human 
dUTPase by sequencing of the first 500 bases. After the transformants were shown 
to contain the dUTPase sequence by PGR amplification, their plasmids were 
harvested and used to transfomi the £ co// strain BL21 (DE3). 

2. Human dUTPase Expression and Activity Analysis 
The BL21/dUTPase clones were induced with IPTG and the expressed 
protein was purified by means of the calmodulin binding peptide (GBP) tag 
expressed as a fusion protein at the amino terminus of the dUTPase sequence. 
The fusion protein was purified on calmodulin agarose, as described in example 10. 
The protein products were analyzed by SDS-PAGE and found to be of the correct 
molecular weight. 

To confirm that the dUTPase clones were active, the Sigma 
pyrophosphatate assay (see Example 1 1 ) was utilized. The assay demonstrated 
that all of the clones tested could convert dUTP to dUMP + pyrophosphate. The 
human dUTPase enzyme was thermolabile and became completely inactive after a 
one minute pre-incubation at 70'*G. 

Polymerase enhancement was also detected with the 5.2 kb on/off assay. 
The assay was modified from the protocol described in Example 1 to allow 
detection of the themiolabile PEF activity. A PGR cocktail was mixed to provide an 
identical starting point for all samples. Ninety-nine microliters of th cocktail was 
aliquoted into six thin-walled, 0.5 ml tubes. The reactions contained 278 ng of 
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human genomic DNA. 200 ng of each primer (see Example 1 ). 200 pM each dNTP. 
2.5 units of Pfu DNA polymerase in 1x cloned Pfu polymerase buffer. At each 60'C 
annealing step. 0.5 pi of one the following were added: human dUTPase 
preparation, a 1/1 0th dilution of the human dUTPase preparation. 2 ng/pl rP45 
(positive control), or dUTPase storage buffer (negative control). Both human 
dUTPase reactions were mn in duplicate. The samples were cycled as follows: 
QS'C for 1 minute (1 cyde); 95»C for 1 minute - 60'C for 1 minute - 72-0 for 5.2 
minutes (30 cycles). 

10 pi of each PCR reaction was visualized on a 1% agarose. 1x TBE gel by 
ethidium bromide staining. (See Figure 23) The undiluted human dUTPase 
preparation was able to enhance the polymerase activity to produce the 5.2 Kb 
band. As demonstrated by the negative control, lack of any enhancing factor 
results in a failed PCR. The positive control for this experiment. rP45. did not work 
in this experiment, presumably because an excessive quantity (30 ng total) was 
added. 

Example 13 

Production of Antibodies to PEF and Western Blot Analysis 
1 . Production of anti-PEF and anti-rP45 IgG 

PEF-specific IgG was purified by immunoaffinity chromatography from the 
sera of rabbits previously immunized against a lot of native Pfu DNA polymerase 
containing PEF (see Figure 24 showing gel of purified native Pfu polymerase 
preps). The S-200-purified Pfu PEF was covalently coupled to AffiGel 10 (BioRad: 
Hercules. CA) in the presence of 20mM HEPES. 1mM DTT, 50mM KCI. 0.05% 
Tween 20. 1 mfi^ EDTA. and 10% glycerol, following the manufacturer's 
recommended protocol. Rabbit sera (2.4 ml) was loaded onto a 0.2 ml column in 
the presence of lOmM Tris (pH 7.5). The column was washed extensively and the 
specific IgG was eluded with 0.1M glycine-HCI (pH 2.5) followed by 0.1M 
triethylamine (pH 11.5). Using a Centricon-30, the IgG was concentrated and the 
elution buffer replaced with PBS. 

In addition, sera containing rP45-specific IgG was obtained by immunizing 
rabbits with recombinant P45. which was prepared as a tagged fusion protein, as 
described in example 10, section 2. The purified enzyme (0.177 mg/ml) was used 
to immunize two N w Zealand white rabbits using the following immunization 
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schedule: 90 ug/rabbit in Complete Freund's Adjuvant (CFA); 18 days later, boost 
with 45 Mg/rabbit in incomplete Freund's adjuvant (IFA); 39 days later, second 
boost; 45 days later, obtained serum sample for Western blot. 

2. Western Blot Analysis Using anti-PEF Antibodies. 

Cell extracts were prepared by suspending cells in 4X SOmlVI Tris, pH 8.2, 
lOmM BME. ImM EDTA, and 10% glycerol, followed by sonication, Then. 2.5mM 
PMSF was added and the cellular debris removed by centrifugation for 15 minutes 
at 14.000 rpm. PEI was added to the supernatant to a final concentration of 0.9% 
and the mixture centrifuged again. The supematants (10 pi) were electrophoresed 
on 4-20% SDS-PAGE gels and the proteins transferred to nitrocellulose by 
electroblotting. The blots were bloclced with 1% Blotto/PBS for 1 hour at room 
temperature and then incubated with PEF-specific IgG ovemight at 4'C. The blots 
were washed in PBS-.05% Tween 20. and then incubated with all^aiine 
phosphatase-conjugated goat anti-rabbit IgG. The blot was washed and then 
incubated in color development solution (100 mM Tris-HCI. pH 9.5, 100 mM NaCI. 5 
mM MgClj. 0.3mg/ml NBT. and 0.15 mg/ml BCIP) for approximately 1-2 minutes. 
The enzyme reaction was stopped and the membrane was washed five times with 
deionized water. 

Figure 25 depicts the results of the Western Blot. Extracts were prepared 
from 5 different fennentations of P. furiosus (lanes 1-5). In addition, three extracts 
or partially purified column fractions from 7. aquaticus (lanes 6-8) and one extract 
from £ CO// (lane 9) were also run. Purified PEF (550 ng) and pre-stained 
molecular weight mariners were run in lanes 10 and 1 1 . respectively. With the 
exception of the mariners, all samples were boiled in SDS/BME dye prior to loading. 
The results show PEF-specific IgG binds to and cross-reacts with components of 
the PEF complex in crude Pfu extracts, including the P50 and P45 components. In 
contrast, no cross-reaction was observed vwth extracts from T. aquaticus or £ coli. 

3. Western Blot Detection of Native PEF and Recombinant P45 
Using anti-rP45 Sera 

Native PEF samples were electrophoresed on a 4-20% gradient Tris-Glycine 
SDS gel. The samples were loaded without denaturation (P300 forni) or after 
partial (boiling in 2%SDS; P45 form) or complete (boiling 2%SDS plus 1%TCA) 
denaturation. The samples were transfen-ed to nitrocellulose and the blots 
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d veloped as described above, except that sera from rabbits immunized with 
recombinant P45 was used (diluted . 1:000). 

in Figure 26. antibodies specific to recombinant P45 crossreact with bands 
in undenatured, partially denatured, and fully denatured samples of native PEF. 
The predominant bands are the P300 aggregate (P45/P50) present in the unhealed 
lane, the 45kD (panially denatured) form present in the boiled/SDS sample, and the 
17kD (fully denatured) form present in the boiled/1% TCA/2% SOS lane, which 
presumably represents the fully denatured monomer. In addition to these different 
aggregation states of native P45. there are also minor bands present in the 
Westem blot, which may represent additional fomis of P45 or the cross-reaction of 
antibodies to Pyrococcus proteins sharing common epitopes with £ co// 
contaminants present in the recombinant P45 preps. 

As with the PEF-specific IgG from above. anti-rP45 sera can also be used to 
identify immunochemically-related proteins from other species. In Figure 27, the 
rP45-specific IgG was used to probe a blot containing cellular extracts from bacteria 
(Bac///us species. T. aquaticus), archea (P. furiosus. M. themoautotmphicum, and 
an uncharacterized species) and human cells (HL60 lymphoma). The results 
obtained show the antibodies bind to proteins from P. furiosus, the uncharacterized 
species, and the human cell line, which migrate with the same apparent molecular 
weight as purified native PEF. The antibodies also bind weakly with proteins in the 
fiac///us and M. thermoautotropicum extracts, although these bands migrate with a 
different apparent molecular weight than P. furiosus PEF species. As with the PEF- 
specific IgG. no cnjss-reactivity was detected with T. aquaticus extracts. 

In a separate Westem assay, samples from Thermus thermophilis were run 
out on an SDS-PAGE gel and probed with the rP45 antisera. The rP45 antibody 
binds to a band of approximately 24kD (between mari<ers of 16kD and 30kD) and 
another band approximately twice that size, presumable a dimeric or multimeric 
form. This demonstrates that PEF activity or protein is present in the T. 
thermophilis samples. The PEF as dUTPase activity in these samples was further 
confimied by a dUTP conversion assay employing reverse phase HPLC. The 
assay detected the tum-over of dUTP with the 7. thermophilis samples, but a 
control sample showed no detectable tum-over of the dUTP present. 
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Example 14 

Use of PEF Complex in Nucleic Acid Replication Reactions 

Initially and as a control to confirm the activity of the DNA polymerase used, 
gapped-duplex calf thymus DNA (Pharmacia) assays were performed. The 
polymerase cocldail contained 50mM Tris-HCI. pH 8.0. SmiVI MgCI,. ^mM DTT. 
50/ig/ml BSA. 4% glycerol. 200^M each dNTP. ('HITTP (.5mCi/;.mole final 
concentration), and 250A«g/ml of activated calf thymus DNA (Phannacia). Samples 
containing Pfu DNA polymerase or P. furiosus PEF were serially diluted in Pfu DNA 
polymerase storage buffer (50mM Tris-HCI, pH 8.2, 0.1% NP-40, 0.1% Tween-20, 
O.ImM EDTA, ImM DTT, 50% glycerol) and then U\ of each dilution was added to 
IG^il aliquots of polymerase cocktail. Polymerization reactions were conducted in 
triplicate for 30 minutes at 72'C. The extension reactions were quenched on ice, 
and then 5iu\ aliquots were spotted immediately onto DE81 filters (Whatman). 
Unincorporated [»H]TTP was removed by 6 washes with 2xSCC (0.3M NaCI. 30mM 
sodium citrate, pH 7.0), followed by one wash with 100% ethanol. Incorporated 
radioactivity was measured by scintillation counting. The assay was calibrated by 
counting a known amount of pHTTTP on DE-81 filters, omitting the wash steps. 
One unit of polymerase activity is defined as the amount of enzyme which catalyzes 
the incorporation of 10 nmoles of total dNTP into polymeric fomi (binds to DE-81 
paper) in 30 minutes at 72"C. Polymerase concentrations (U/ml) were extrapolated 
from the slope of the linear portion of units vs. enzyme volume plots. 

The PEF samples tested exhibit no significant DNA polymerase activity while 
the Pfu DNA polymerase exhibited a specific activity of 2-4 x 10" u/mg. 

1. Enhancement of Cloned Pfu DNA Polymerase with Pfu PEF 
P. furiosus PEF has been demonstrated to enhance the yield of PGR 
products generated with recombinant Pfu DNA polymerase using plasmid, lambda, 
and genomic DNA templates (Figures 28-30). The results demonstrate that the 
addition of P. furiosus PEF increases PGR product yield for a variety of PGR 
systems, ranging in target complexity. Relatively easy targets, e.g. plasmid DNA, 
can be successfully amplified with Pfu DNA polymerase, and the addition of PEF 
further increases product yield (Figure 28). We have found that fewer PGR cycles 
or lower template concentrations can be used in PEF-containing reactions. 
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demonstrating the advantageous properties of the proteins and compositions 
described and the methods employing them. 

The most dramatic enhancements were observed when long (Figure 29) 
and/or highly complex targets (Figure 30) are amplified. The presence of PEF 
unexpectedly and significantly improves the amplification of these targets, which 
are often poorly amplified by single enzyme PCR reactions. In Figure 29. the 
addition of 1-100 ng of PEF {S200-purified PEF; prep. 3) to lOOpI PCR reactions 
containing SU of Pfu DNA polymerase significantly increased yields of a lOkb PCR 
product. In Figure 30. a 5.2kb target was successfully amplified from human 
genomic DNA in the presence of .3-280ng PEF (SCS #52 S200 purified) per lOOpI 
PCR, but not in the absence of PEF, despite the use of 1.9 min. per kb extension 
times. 

P. furiosus PEF has been found to enhance the yields of both specific and 
non-specific PCR products, when amplifications are conducted under less stringent 
PCR conditions with PCR primers that hybridize to more than one target. £ coli 
ssb (single-stranded binding protein; Stratagene's PerfectMatch) has been shown 
previously to increase the specificity of primer extension reactions, presumably by 
minimizing the fonnation of pooriy matched primer-template complexes. When 
used in conjunction with £ co// ssb, P. furiosus PEF has been found to enhance the 
yield of specific PCR products (Figure 31). Pfu PEF also enhances yields of PCR 
products obtained with exo-Pfu and a mixtures of Taq and Pfu polymerase (for 
example, TaqPlus Long^". Stratagene; La Jolla, CA). Therefore, Pfu PEF is useful 
with polymerase mutants, truncated versions of polymerases, mixtures of 
polymerases, and polymerase-additive combinations (for example, Perfect l^atch®. 
Stratagene). 

2. Enhancement of Native Pfu DNA Polymerase with Pfu PEF 

Subsequent to identifying PEF from P. furiosus, we recognized that certain 
lots of native Pfu DNA polymerase preparations contained PEF. Varying amounts 
of the >250kD aggregate could be detected on silver-stained SDS-PAGE gels (e.g., 
lots 38 and 46 in Figure 24). Eleven of the 23 preparations examined were found to 
visibly contain low levels (8/11 lots; 0.1-1% total protein) to high levels (3/11 lots; 
10-30% total protein) of PEF. PEF co-migrates with Pfu DNA polymerase during 
the initial Q- and SP-S pharose columns in Pfu DNA polymerase purification, and 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-69- 

elutes just after the major peak of Pfu DNA polymerase activity on the Heparin 
Sepharose column (Figure 32). Pfu DNA polymerase fractions pooled after the 
Heparin Sepharose step are typically contaminated with varying amounts of PEF, 
depending upon the column efficiency and pooling strategy employed. 

Contamination of native Pfu DNA polymerase with varying amounts of PEF 
could potentially contribute to lot-to-lot variation in the performance of native Pfu 
DNA polymerase in PCR. It is expected that lots containing approximately 1-100 ni 
of PEF per 2.5U of Pfu DNA polymerase will give rise to higher PCR product yields 
than amplifications conducted with cloned Pfu DNA polymerase or native Pfu DNA 
polymerase lots contaminated with < 10 pg per 2.5U Pfu DNA polymerase (<.02 % 
total protein). In theory, a lot containing certain PEF concentrations would exhibit 
reduced Pfu DNA polymerase performance, based upon the apparent inhibition of 
PEF at high concentrations discussed below (>900 ng per 2.5U Pfu DNA 
polymerase in lOOpi PCRs). 

When adding PEF to native Pfu DNA polymerase PCR amplifications, it is 
anticipated that the level of PEF contained in a particular lot of native Pfu must be 
taken into account to avoid smearing, inhibition of synthesis, or sub-optimal 
enhancement. 

Example 15 
Use of PEFs in Amplification Reactions 
1. Activity of Pfu PEF in a Standard PCR Protocol 
To enhance PCR product yield. P. furiosus PEF is added, separately or pre- 
mixed with the DNA polymerase, to standard PCR amplifications. PCR 
amplification reactions generally consist of the following: .01-.05U/pl DNA 
polymerase. 2.5 pg/pi to 1 ng/pl P. furiosus PEF. 1-5 ng/pl of each primer, .5-10 
ng/yi of genomic DNA template, and 0.2mM each dNTP in a suitable buffer (e.g.. 
cloned Pfu DNA polymerase buffer consists of 10mM KCI. lOmM (NHJjSO^. 20mM 
Tris-HCI (pH 8.8). 2mM MgS04. 0.1%(v/v) Triton X-100. and 100ng/Ml BSA). 
Amplifications from less-complex templates (e.g.. lambda or plasmid DNA) are 
typically successful with 0,1-100 pg//il DNA. The minimum amounts of native PEF 
typically used in PCR correspond to amounts of enzyme sufficient to catalyze the 
release of 0.1 to 0.4 nmole PPi/hr per IOOmI PCR reaction (see example 11. section 
2). PCR amplifications are conducted under standard conditions; e.g., 94-96 "C for 
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0.5-2 min. (1 cycle)/ 94.96'C for 0.5-2 min.; 50-65°C for 0.5-2 min.; 68-72'C for 

0.5-3 min. per kb of target amplified (3040 cycles)/72X for 0-10 min. (1 cycle). 

2. Enhancement of PCR Amplification Reactions Employing 
DNA Polymerases Other than Pfu DNA Polymerase 

P. furiosus PEF has been found to enhance the performance of other a-type 

(Family B-related) DNA polymerases from thermophilic archea. Enhanced PCR 

product yields were observed when P. furiosus PEF was added to amplifications 

conducted with DNA polymerases from both Pyrococcus and Themococcus 

species. DNA polymerases demonstrated to function with P. furiosus PEF include: 

Pwo DNA polymerase (Boehringer Mannheim; cloned from P woesei). Deep Vent 

DNA polymerase (New England Biolabs; cloned from P. sp. GSD). JDF3 DNA 

polymerase (Stratagene; cloned from P sp. JDF31 ES4 DNA polymerase 

(Stratagene; purified from P sp. ES4, renamed P endeavori) and Vent DNA 

polymerase (New England Biolabs; cloned from 7. litoralis). 

In Figure 33, S200-purified Pfu PEF (prep. 1) increased yields of a 6.2kb 

PCR product obtained with 2.5U of Pwo DNA polymerase. In Figure 34, the 

addition of S200-purified PEF (prep, 3) to JDF3 DNA polymerase PCRs increased 

the yield of a 10kb product amplified from lambda DNA (lanes 5-7; 11-13) and 

mouse genomic DNA containing 40 copies (lanes 8-10) or 1 copy (lanes 14-16) of a 

lambda DNA transgene. In JDF3 DNA polymerase-based PCRs, amplifications are 

typically conducted with 1U of enzyme and extension times of 0.5 min./kb target. In 

Figure 35. the addition of 5 ng of S200-purified PEF (prep. 3) to ES4, JDF3, Pfu. 

and Vent DNA polymerase PCRs increased the yield of a 1.9kb product amplified 

from P. furiosus genomic DNA. In Figure 36. the addition of P. furiosus PEF was 

also shown to increase yields of a 2kb PCR pnDduct amplified with Deep Vent DNA 

polymerase from transgenic mouse genomic DNA. 

The addition of P furiosus PEF may not enhance the yield of PCR products 
generated with Taq DNA polymerase (Figure 34, lanes 1-4 and Figure 23, lanes 9- 
10). Taq DNA polymerase is a Pol l-like (Family A-retated) DNA polymerase 
isolated originally from the thermophilic eubacteria Thermus aquaticus. 

3. Enhancement of RT«PCR Reactions 

P. furiosus PEF has also been shown to enhance the yield of PCR products 
amplified from reverse transcribed RNA (cDNA) in a process known as RT-PCR, 
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known in the art Enhancement has been observed in both 2-step (Figure 37) and 
1-tube RT-PCR protocols (data not shown). In the former procedure, aliquots of 
cDNA synthesis reactions are added to PCR reactions containing a themiostable 
DNA polymerase (e.g.. Pfu DNA polymerase) and P. furiosus PEF. In the latter 
approach. RNA is added to reaction mixtures containing a thermolabile RT. dNTPs. 
primers, a thennostable DNA polymerase (Pfu DNA polymerase), and P. furiosus ' 
PEF. CDNA synthesis and PCR amplification take place sequentially, in the same 
tube, by conducting cDNA synthesis at 37.42-C. followed by PCR amplification at 
elevated temperatures. 

In the 2-step RT-PCR procedure. cDNA synthesis is first performed by 
combining the following reagents (SOpI final volume): Spg total RNA pre-annealed to 
300 ng of primer (oligo dT. random hexamers. or a gene-specific primer). 4mM 
each dNTP. 20U RNase block (optional), and SOU MMLV RT (or other RT) in buffer 
containing SOmM Tris-HCI (pH 8.3). 75mM KCI. 3mM MgCI^. and DEPC-treated 
water. cDNA synthesis is allowed to proceed for 1 hour at 37-42»C. After heat 
inactivation of the RT. Ipl of cDNA is added to a SOpI PCR reaction containing 5U 
Pfu DNA polymerase. .01-50 ng P. furiosus PEF. IpM of each primer, and 0.2mM 
each dNTP in buffer consisting of lOmM KCI, 10mM (NHJjSO,. 20mM Tris-HCI (pH 
8.8). 3mM MgS04. 0.1%(v/v) Triton X-IOO. and lOO/^g/ml BSA. PCR can be 
conducted using the following cycle conditions: 94'C 2 min.; 60"C 2 min.; 68'C 1 
min. per kb (1 cycle) and then 94'C 1 min.; 60»C Imin.; 68X Imin. per kb (40 
cycles). 

The enhancement of RT-PCR with P. furiosus PEF was evaluated using 
PCR primers designed to span at least one intron-exon junction in the EGF receptor 
gene. Two primer sets were used (antisense: 5'GAG-TTA-AAT-GCC-TAC-ACT- 
GTA-TCT (SEQ ID NO.: 29); sense: 5'CAG-GAC-TCA-GAA-GCT-GCT-ATC-GAA 
(SEQ ID NO.: 30)(1kb) or S'CTG-CAC-GTG-CCC-TGT-AGG-ATT-TGT (SEQ ID 
NO.: 31)(3kb)). which generate PCR products of Ikb or 3kb, as indicated, when 
amplification occurs from spliced RNA rather than contaminating DNA. 

The PEF concentration which gives optimal performance was determined by 
titrating PEF preparation 3 (S-200 purified) and preparation 4 (heparin sepharose 
fraction) in the 2-step RT-PCR procedure described here. With PEF preparation 4. 
significant increase in the yield of the Ikb product was observed when O.OOI-lpl 
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was added (10 pg-lO ng PEF) (Figure 37). Synthesis of the 3kb product was 
significantly enhanced when CMpI (MO ng PEF) of preparation 4 was added. 
With PEF preparation 3. significant increases in the yields of both the 0.6kb and the 
3kb products were observed for all amounts tested in the range of 0.002-0.1 pi (1-50 
ng). 

4. Enhancement of Seamless Cloning Protocol 

Seamless Cloning was performed using Stratagene's Seamless™ Cloning 
kit (Stratagene; La Jolla. CA. 1997/1998 Stratagene Catalog, specifically 
incorporated herein by reference), following the recommended protocol. The effect 
of P. furiosus PEF on the efficiency of Seamless Cloning is shown in Figure 38. 
Increased yield of a 7.2 kb PCR product was observed when 5 ng of S-200 purified 
PEF (prep. 1) was added to 50^1 PCR reactions containing 2.5U Pfu DNA 
polymerase and methyl dCTP. Amplifications conducted in the presence of PEF 
utilized 1 min. per kb extension times. In the absence of PEF, very little PCR 
product was generated despite the use of longer 2 min./kb extension times. 

5. Enhancement of Linear Amplification Reactions: 
QuIkChange™ Mutagenesis Protocol 

Site-specific mutagenesis can be accomplished efficiently with double- 
stranded DNA templates using a linear amplification-based strategy employing Pfu 
DNA polymerase (QuikChange™ Site-Directed Mutagenesis Kit; Stratagene; La 
Jolla. CA. 1997/1998 Stratagene Catalog, specifically incorporated herein by 
reference). PCR primers containing the desired mutation(s) are designed to anneal 
to the same site on opposite strands. Primer extension reactions are conducted 
with a thermostable DNA polymerase (e.g. Pfu DNA polymerase) at temperatures 
which allow efficient synthesis in the absence of strand displacement activity 
(ea'C). The amplification product is treated with Dpnl to digest the parental 
methylated plasmid DNA and the resulting gapped, double-stranded DNA is then 
transformed into competent £. co// cells, h/lutant clones are identified by DNA 
sequencing. 

In evaluating P. furiosus PEF, mutagenesis was conducted using 
Stratagene's Quik Change mutagenesis kit, except that both recombinant and 
native Pfu DNA polymerase were used in the kit-supplied reaction buffer. The 
effect of P. furiosus PEF on the efficiency of QuikChange mutagenesis is sh wn in 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-73- 

Figure 39. The addition of .04 to 4 ng of PEF of PEF prep. 2 (heparin sepharose 
fraction lot 36 H.S. #78; 40ng//il) to 50^1 reactions increased the number of 
transfomfiants generated by native and cloned Pfu DNA polymerases, while 
retaining mutation frequencies of 90-97%. Optimal results were obtained with 0.4 
ng of PEF. which gave 7.5-fold and 5.3-fold increases in the number of mutant 
colonies generated with native and cloned Pfu DNA polymerase, respectively. 

The use of PEFs in the QuikChange™ mutagenesis protocol corresponds to 
the use of PEFs in other linear amplification reactions known in the art. such as 
cycle sequencing reactions, primer extension reactions, and the like. PEFs can be 
employed in any linear amplification method to enhance the activity of the 
polymerase used. For example, the effect of Pfu PEF on cycle sequencing can be 
evaluated by comparing the quality and length of sequencing ladders generated 
with a polymerase, for example exo-Pfu DNA polymerase, in the absence and in the 
presence of PEF. A number of different cycle sequencing reactions, known to one 
skilled in the art. can be used in combination with the PEF complexes and proteins 
of this invention to enhance polymerase activity. In addition, primer extension 
reactions can also be enhanced with the use of PEFs. Numerous primer extension 
reactions are known in the art. 

Example 16 
Enhancing Titer of PEF 

The nucleic acid replication enhancing activity of several different 
preparations of Pfu PEF have been evaluated in PGR, PCR-related applications, 
linear amplification-based applications, mutagenesis applications, cycle sequencing 
applications, and primer extension applications. One skilled in the art will 
appreciate that similar methods to optimize the use of any PEF, such as those 
specifically discussed herein, are apparent from the disclosure herein. A sample of 
substantially homogeneous PEF (e.g. S200-purified) enhances the performance of 
Pfu DNA polymerase in PGR amplification reactions when added at concentrations 
spanning a 10.000-fold range (0.09-900 ng/IOOpI). The highest yields of amplified 
product are observed in the presence of =1 to 100 ng of P50. The addition of 
excess PEF (>900 ng/IOOpl. where protein concentration was determined by the 
silver-staining intensity of the P50 band as compared to known protein standards) 
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or very low PEF concentrations (< 9 pg/1 00/^1) in a PGR reaction resulted in 
lower PGR product yield. 

The relative purity and PEF content of 4 preparations was examined by 
SDS-PAGE analysis (Figure 10). Preparations 1 and 3 consist of S200-purified 
PEF of >95% homogeneity, while preparations 2 and 4 consist of concentrated 
heparin sepharose fractions of 10-20% homogeneity. 

The PGR enhancing titer of S200-purified P. furiosus PEF (prep. 1; Figures 
40 and 41) was detemnined using the F432-21/R6656-20/AAA742 primer-template 
system described in example 1. This preparation is approximately 95% pure, 
contains <.001U/mI DNA polymerase activity, and = 225 ng/pl PEF. PGR enhancing 
activity was found to be optimal when .004-.4mI (.9-90 ng) of homogeneous P. 
furiosus PEF was added to IOOmI PGR reactions containing 2.5U of Pfu DNA 
polymerase. Reduced, but significant, PGR product yield was observed when 4^1 
(900 ng) or .0004mI (.09 ng) of the S200-purified protein was added (Figure 40). In 
a second identical experiment, significant enhancement was noted when .004-4^1 
was added, and very little improvement was noted with .0004mI (Figure 41). 

The PGR enhancing titer of PEF preparation 2 was also determined (Figure 
42-43). Preparation 2 consisted of a concentrated (Microcon-30) heparin 
sepharose fraction, in which P. furiosus PEF made up approximately 10% of the 
total protein and was present at approximately 40 ng/pl. Enhanced PGR product 
yield was observed when .0002-.2mI (.008-8 ng) of the column fraction was added 
to lOOpI PGR reactions containing 2.5U of Pfu DNA polymerase, with greatest 
enhancements observed in the range of .002-.2jjl (.08-8 ng) (Figure 42). In a 
second identical experiment, enhanced PGR product yield was observed when 
.004-.04pl (.16-1.6 ng) of PEF preparation 2 was added to 2.5U of DNA polymerase 
per IOOmI PGR reaction (Figure 43). No PGR product was observed in the 
presence of 4mI (160 ng) or .0004pl (.016 ng) of the column fraction, while a smear 
was generated when .4pl (16 ng) of the column fraction was added to PGR. 
Smeary PGR products were also noted previously when Ipl of heparin sepharose 
fractions containing the highest concentrations of PEF are added to PGRs (e.g., lot 
37 H.S. fractions 69-81 in Figure 44; PEF prep. 2 in Figure 31). 

In summary, sunstantially homogeneous P. furiosus PEF enhances the 
perfomiance of Pfu DNA polymerase in test PGR amplifications when added at 
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concentrations spanning a 10.000.fold range (0.09-900 ng/lOOpI). The highest 
yields are observed in the presence of M to 100 ng of PEF. The addition of excess 
PEF (>900 ng/IOOMi) or PEF < 9 pg/IOOpI PGR reaction, was found to give reduced 
perfomnance (lower PGR product yield). Partially-purified PEF samples (heparin 
sepharose column fractions) also appear to enhance PGR product yield over a fairly 
broad range of PEF concentrations. With the column fraction analyzed here, 
highest yields of PGR were obtained in the range of .08 ng to 8 ng. The addition of 
higher amounts of the column fraction resulted in smearing (16-40 ng) or lack of 
enhancement (160 ng). 

Inhibition of PGR enhancement at high concentrations of PEF appears to 
occur irespective of the purity of the PEF sample. The addition of higher 
concentrations of homogeneous PEF (>900 ng) resulted in lower yields of PGR 
product than could be obtained with <900 ng PEF. Heparin sepharose fractions of 
10-20% purity also gave reduced PGR product yields when high amounts of PEF 
were added. Up to 8 ng of PEF in prep. 2 (H.S. #78 fraction) could be added before 
smearing or inhibition occun-ed. The discrepancy between the amount of PEF 
which is inhibitory in homogeneous preparations (>900 ng). as compared to 
partially-purified column fractions (>16 ng), suggests that additional protein or DNA 
contaminants may be present in the heparin sepharose fractions. 

Examination of heparin sepharose fractions revealed that P. furiosus PEF 
elutes just after the major peak of Pfu DNA polymerase activity (e.g., fractions 50- 
61 in Figure 32). SDS-PAGE analysis showed that the highest levels of the >250kD 
PEF appeared in fractions 63-78 from the native Pfu DNA polymerase purification 
(Figure 32). PGR enhancing activity was observed in fractions 37-90 (Figure 44). 
Discrete PGR product bands were generated with fractions 37-48 and 87-90. which 
contain very low levels of PEF. Interestingly, DNA smears were generated with 
fractions 69-81 (Figure 44), which contain the highest levels of PEF and no 
detectable Pfu DNA polymerase. These results are consistent with the hypothesis 
that P. furiosus PEF acts as an inhibitory substance when present at high 
concentrations. However, concentrations high enough to adversely effect 
polymerization and PGR reactions can easily be determined through the methods 
and assays described herein. Thus, one skilled in the art can avoid concentrations 
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of PEF that inhibit polymerase activity or PCR reactions without undue 
experimentation. 

As observed with PCR. inhibition during linear amplification protocols was 
noted with high concentrations of PEF-containing heparin sepharose fractions 
(Figure 39). The addition of 40 ng (VI prep. #2) of PEF to QuickChange'** 
reactions resulted in reduced yield of amplification product, as visualized by a 
reduction in the ethidium bromide-staining intensity of DNA bands on agarose gels. 
Reduced yield accompanied a 1.8 to 2.8-fold reduction in the number of 
transfomiants and a slight, but reprodudble. decrease in mutation frequency. 

A possible and the most likely explanation for inhibition by homogenous PEF 
preparations is depletion of dCTP. In Example 1 1 , section 1 , we demonstrated that 
PEF can utilize dCTP as a substrate, although much less efficiently than dUTP. At 
high PEF concentrations it is possible that enough dCTP is hydrolyzed by PEF to 
drop the dCTP levels below what is required for optimal DNA synthesis. It is also 
possible that moderate to high levels of PEF could alter dCTP levels enough to 
affect DNA polymerase misincorporation rates. Alternatively, contaminants in the 
substantially homogenous PEF preparations may also cause the inhibition and may 
only be present in sufficient concentrations when high concentrations of PEF are 
used. 

Each of the references referred to herein can be relied on by one skilled in 
the art in making and using embodiments of the invention. In addition, each 
reference is specifically incorporated, in its entirety, into this disclosure. 

The sequence listing information that follows incorporates the sequences in 
prior U.S. Patent application serial no. 08/822,744, which is specifically incorporated 
herein by reference. The sequence information from any one sequence or any 
combination of sequences can be translated into a computer readable medium by 
those of skill in the art. Furthermore, the sequences of the specific clones or 
plasmids described or identified herein can be easily detennined and used in a 
computer readable medium by one skilled in the art. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT(S) : HOOREFE, Holly 
HANSEN, Connie J 
(iil TITLE OF INVENTION: Polymerase Enhancing Factor (PEF) 

Extracts, PEF Protein Complexes, 
Isolated PEF Proteins, and Methods for 
Purifying and Identifying Them 

(iii) NUMBER OF SEQUENCES: 89 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: M. Paul Barker, Esq., Finnegan, 
Henderson, Farabow, Garrett & Dunner, LLP 

(B) STREET: 1300 I. Street, N.W. 

(C) CITY: Washington 

(D) STATE: DC 
(F) ZIP: 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: NOT YET KNOWN 

(B) FILING DATE: 20-MARCH-1998 

(C) CLASSIFICATION: 
(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: BARKER, M. PAUL 

(B) REGISTRATION NUMBER: 32,013 

(C) REFERENCE/DOCKET NUMBER: 4121.0116.02304 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 408-4000 

(B) TELEFAX: 202 4 08-4400 
(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Xaa Xaa Leu His His Val Lys Leu lie Tyr Ala Thr Xaa Xaa Xaa 
1 5 10 15 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Xaa 
^ 10 15 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Leu Leu His His Val Lys Leu He Tyr Ala Thr Lys Xaa Arg Xaa 
^5 10 15 

Leu Val Gly Lys Xaa He Val Leu Ala He Pro Gly Xaa Xaa Ala Xaa 
20 25 30 

Xaa Xaa Xaa 
35 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Glu Xaa 
^ S 10 15 

Xaa Xaa 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Tyr Asp Ala Val He Met Ala Ala Ala Val Val Asp Phe Arg Pro 
^5 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Asp Leu Val Val Gly Asn Thr Leu'ciu Ala Phe Gly Ser Glu Glu 

^ 10 15 

Aan Gin Val Val Leu He Gly Arg 
20 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: untaown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Gly Ala Met Leu His His Val Lys Leu 'lie Tyr Ala Xaa Lys Leu Arg 
1 5 10 15 

Lye 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Xaa Xaa 
15 10 15 

Arg Lys 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-tenninal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Lys Phe Arg Lys Glu Glu Ser 
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^5 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Ala He Leu Leu Pro Asp Trp Lya He Arg Lys Glu He Leu He 
15 10 15 

Glu 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(V) FRAQIENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Met His His Val He Lys Leu Xaa Tyr Ala Thr Xaa Ser Arg Lys 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
(B| TYPE: amino acid 
(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Tyr Leu Val Arg Pro Asp Trp Lys Arg Arg Lys Glu He Leu 
15 10 15 

He Glu 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAYCAYGAHA ARYTHATTTA CGC 23 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCCATDATNA CDGCRTCGTA TTT 
(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAYCAYGAHA ARYTHATATA CGC 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ARDACDACYT GRTTTTCTTC 2o 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGCTTCACC ACGTCAAGCT AATCTACGCC ACAAAAAGTC " GAAAGCTAGT TGGAAAAAAG 60 
ATAGTCNNNN NNNNNCCAGG GAGTATTGCG GCTTTGGATG TGAAAGCTTG TGAGGGACTA 120 
ATTAGGCATG GGGCCGAAGT TCATGCAGTG ATGAGTGAGG CAGCCACCAA GATAATTCAT 180 
CCTTATGCAT GGAATTTGCC CACGGQAAAT CCAGTCATAA CTGAGATCAC TGGATTTATC 240 
GAGCATGTTG AGTTAGCAGG GGAACATGAG AATAAAGCAG ATTTAATTTT GGTTTGTCCT 300 
GCCACTGCCA ACACAATTAG TAAGATTGCA TGTGGAATAG ATGATACTCC AGTAACTACA 360 
GTCGTGACCA CAGCATTTCC CCACATTCCA ATTATGATAG CCCCAGCAAT GCATGAGACA 420 
ATGTACAGGC ATCCCATAGT AAGGGAGAAC ATTGAAAGGT TAAAGAAGCT TGGCGTTGAG 480 
TTTATAGGAC CAAGAATTGA GGAGGGAAAG GCAAAAGTTG CAAGCATTGA TGAAATAGTT 54 0 
TACAGAGTTA TTAAAAAGCT CCACAAAAAA ACATTGGAAG GGAAGAGAGT CCTAGTAACG 600 
GCGGGAGCAA CAAGAGAGTA CATAGATCCA ATAAGATTCA TAACAAATGC CAGCAGTGGA 660 
AAAATGGGAG TAGCGTTGGC TGAAGAAGCA GATTTTAGAG GAGCTGTTAC CCTCATAAGA 720 
ACAAAGGGAA GTGTAAAGGC TTTTAGAATC AGAAAAATCA AATTGAAGGT TGAGACAGTG 780 
GAAGAAATGC TTTCAGCGAT TGAAAATGAG TTGAGGAGTA AAAAGTATGA CGTAGTTATT 840 
ATGGCAGCTG CTGTAAGCGA TTTTAGGCCA AAAATTAAAG CAGAGGGAAA AATTAAAAGC 900 
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GGAAGATCAA TAACC3ATAGA GCTCGTTCCN NNNAATCCCA AAATCATTGA TAGAATAAAG 960 
GAAATTCAAC CAAATGTCTT TCTTGTTGGA rTTAAAGCAG AAACTTCAAA AGAAAAGCTT 1020 
ATAGAAQAAG GTAAAAGGCA GATTGAGAGG GCCAAGGCTG ACTTAGTCGT TGGTAACACA 1080 
TTGGAAGCCT TTGGAAGCGA GOAAAACCAA GTAGTATTAA TTGGCAGAGA TTTCACAAAA 1140 
GAACTTCCAA AAATGAAAAA GAGAGAGTTA GCAGAGAGAA TTTGGGATGA GATAGAGAAA 1200 
TTNCTGTCC ^^qo 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Ser Arg Lya Leu 

15 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser He Ala Ala Leu 

20 25 30 

Asp Val Lys Ala Cys Glu Gly Leu He Arg His Gly Ala Glu Val His 

35 40 45 

Ala Val Met Ser Glu Ala Ala Thr Lys He He His Pro Tyr Ala Trp 

50 55 60 

Asn Leu Pro Thr Gly Asn Pro Val He Thr Glu He Thr Gly Phe He 
« 70 75 BO 

Glu His Val Glu Leu Ala Gly Glu His Glu Asn Lys Ala Asp Leu He 

as 90 95 

Leu Val Cys Pro Ala Thr Ala Asn Thr He Ser Lys He Ala Cys Gly 

100 105 110 

He Asp Asp Thr Pro Val Thr Thr Val Val Thr Thr Ala Phe Pro His 

lis 120 125 

He Pro He Met He Ala Pro Ala Met His Glu Thr Met Tyr Arg His 

130 135 140 

Pro He Val Arg Glu Asn He Glu Arg Leu Lys Lvs Leu Gly Val Glu 
145 150 155 ' 160 

Phe He Gly Pro Arg He Glu Glu Gly Arg Ala Lys Val Ala Ser He 

165 170 175 

Asp Glu He Val Tyr Arg Val He Lys Lys Leu His Lys Lys Thr Leu 

180 185 190 

Glu Gly Lys Arg Val Leu Val Thr Ala Gly Ala Thr Arg Glu Tyr He 

195 200 205 

Asp Pro He Arg Phe He Thr Asn Ala Ser Ser Gly Lys Met Gly Val 

210 215 220 

Ala Leu Ala Glu Glu Ala Asp Phe Arg Gly Ala Val Thr Leu He Arg 
225 230 235 240 

Thr Lys Gly Ser Val Lys Ala Phe Arg He Arg Lys He Lys Leu Lys 

245 250 255 

Val Glu Thr Val Glu Glu Met Leu Ser Ala He Glu Asn Glu Leu Arg 

260 265 270 

Ser Lys Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe 

275 280 285 

Arg Pro Lys He Lys Ala Glu Gly Lys He Lys Ser Gly Arg Ser He 

290 295 300 

Thr He Glu Leu Val Pro Xaa Asn Pro Lys He He Asp Arg He Lys 
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310 320 
Glu He Gin Pro Asn Val Phe heu Val Gly Phe Lye Ala Glu Thr Ser 

325 330 335 

Lye Glu Lye Leu He Glu Glu Gly Lye Arg Gin He Glu Arg Ala Lys 

345 350 
Ala ABp Leu Val Val Gly Aen Thr Leu Glu Ala Phe Gly Ser Glu Glu 

360 365 

Aen Gin Val Val Leu He Gly Arg Asp Phe Thr Lys Glu Leu Pro Lys 

370 375 3ao 

Met Lys Lys Arg Glu Leu Ala Glu Arg He Trp Asp Glu He Glu Lys 
390 395 

Xaa Leu Ser 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CATAGCGAAT TCGCAAAACC TTTCGCGGTA TGG 33 
(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ACTACGGAAT TCCACGGAAA ATGCCGCTCA TCC 33 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGCGTTTCCG TTCTTCTTCG 20 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-84. 

(iv) ANTI -SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:23- 
CCATCTCACG CGCCAGTTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unJcnown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 
(ii) MOLE<OT-E TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 26: 
Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJmown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GAGCTTGCTC AACTTTATC 
(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GATAGAGATA GTTTCTGGAG ACG 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 
(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGGGATATCO ACATTTCT6C ACC 
(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENOTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GAGTTAAATG CCTACACTGT ATCT ' ' 24 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30; 
CAGGACTCAG AAGCTGCTAT CGAA 24 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTGCACGTGC CCTGTAGGAT TTGT 24 
(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCAGAYTGGA ARWKNAGGAA AGA 23 
(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CCAQAYTGGA ARWKNAGAAA AGA 23 
(2) INFORMATION FOR SEQ ID NO:34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknovoi 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCAGAYTGGA ARWKNAGGAA GGA 23 
(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unicnown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCAGAYTGGA ARWKNAGAAA GGA 2 3 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAGAGTGGGC AGAGAGGCTN TTGTTAAGGG GAAATTAATC GACGTGGAAA 
AGGAAGGAAA 60 
AGTCGNTATT CCTCCAAGGG AATA 84 
(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu Trp Ala Glu Arg Leu Leu Leu Arg Gly Aen Xaa Ser Lys Trp Lys 
15 10 15 
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Arg Lys Glu Lye Ser Xaa Phe Leu Gin Gly Asn 
20 25 
(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Arg Val Gly Arg Glu Ala Xaa Val Lys Gly Lys Leu He Glu Val Glu 
15 10 15 

Lys Glu Gly Lys Val Xaa He Pro Pro Arg Glu 
20 25 
(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gin Ser Gly Gin Arg Gly Xaa Cys Xaa Gly Glu He Asn Arg Ser Gly 
15 10 15 

Lys Gly Arg Lys Ser Arg Tyr Ser Ser Lys Gly Leu 
20 25 
(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unimown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGCCCACTC TGAGGTCATA ACCTGCTGGT TGGAGCCATT CTTCAGAAAA TGGCTCTATA 60 
AGTATTTCTT TTCTGATTTT CCAGTCTGGA AGTAGCATTT TACCACCGAA ACCTTTATTT 120 
TTAATTTAA 3^29 
(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Xaa He Lys Asn Lys Gly Phe Gly Gly Lys Met Leu Leu Pro Asp Trp 
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15 10 15 

Lye He Arg Lys Glu He Leu He Glu Pro Phe Ser Glu Glu Trp Leu 

25 30 
Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
35 40 
(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 
(B| TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TCCTCCAAGG GAATACGCCT TAATCCTAAC CCTCGAGAGG ATAAAGTTGC CCGACGATGT 60 
TATGGGGGAT ATGAAGATAA GGAGCAGTTT AGCAAGAGAA GGGGTTATTG GTTCnTTGC 120 
TTGGGTTGAC CCAGGATGGG ATGGAAACTT AACACTAATG CTCTACAATG CCTCAAATGA 180 
ACCTGTCGAA TTAAGATATG GAGAGAGATT TGTGCAGATC GCATTTATAA GGCTAGAGGG 240 
TCCGGCAAGA AACCCTTACA GAGGAAACTA TCAGGGGAGC ACAAGGTTAG CGTTTTCAAA 300 
GAGAAAGAAA CTCTAGCGTC TTTTCAATAG CATCCTCAAT ATCTCGTGTG AAGTAATCAA 360 
TGTAAATACT TGCT GGGTGG GTTTTTAGGG ATTCAAACTC GTAAGATGGG CCTGTATAGC 420 
AGAAAACTAT TTTTGCCTCT TCTTCATTTA TCTTTCTGTG AATAAAAAAT CCAACATCCA 480 
CACTAGTTCC AAAAGATATT GTTTGCGTGA TTACCAACAA GATCTTGGCA TTATTTTTGA 540 
TCTTATACTC TA TTCTC CTT TCTCCCTCCA ATTTGCCCAA AATAAACCTG GGTAGTATAC 600 
ATTCACTCCT CTCTTTTAAA TTCCTATAAA TTCGTACATA GTTTAGAAAA ATGTCAAATT 660 
CTTTNTTCCC TGTTAAATTA ACCNCNAAAT CTTTAINANN AANCTTTTTA TAATTCCCAA 720 
AACCCCTAAT TTTCCCCTTN 740 
(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) 


MOLECULE TYPE: peptide 










(iii) 


HYPOTHETICAL: YES 










(iv) 


ANTI- SENSE: NO 










(V) 


FRAGMENT TYPE: N- terminal 








(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


:43: 




Leu 


Gin Gly Asn Thr Pro Xaa 


Ser 


Xaa 


Pro Ser Arg Gly Xaa 


Ser Cys 


1 


5 






10 


15 


Pro 


Thr Met Leu Trp Gly He 


Xaa 


Arg 


Xaa Gly Ala Val Xaa 


Gin Glu 




20 




25 


30 




Lys 


Gly Leu Leu Val Leu Leu 


Leu 


Gly 


Leu Thr Gin Asp Gly 


Met Glu 




35 


40 




45 




Thr 


Xaa His Xaa Cys Ser Thr 


Met 


Pro 


Gin Met Asn Leu Ser 


Asn Xaa 




50 55 






60 




Asp 


Met Glu Arg Asp Leu Cys 


Arg 


Ser 


His Leu Xaa Gly Xaa 


Arg Val 


65 


70 






75 


80 


Arg 


Gin Glu Thr Leu Thr Glu 


Glu 


Thr 


He Arg Gly Ala Gin 


Gly Xaa 




85 






90 


95 


Arg 


Phe Gin Arg Glu Arg Asn 


Ser 


Ser 


Val Phe Ser He Ala 


Ser Ser 




100 




105 


110 




He 


Ser Arg Val Lys Xaa Ser 


Met 


Xaa 


He Leu Ala Gly Trp 


Val Phe 
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115 120 125 

Arg Asp Ser Asn Ser Xaa Asp Gly Pro Val Xaa Gin Lys Thr He Phe 

"0 135 140 

Ala Ser Ser Ser Phe He Phe Leu Xaa He Lys Asn Pro Thr Ser Thr 

150 155 
Leu Val Pro Lys Asp He Val Cys Val He Thr Asn Lys He Leu Ala 

165 170 
Leu Phe Leu He Leu Tyr Ser He Leu Leu Ser Pro Ser Asn Leu Pro 

IBO 185 190 

Lys He Asn Leu Gly Ser He His Ser Leu Leu Ser Phe Lys Phe Leu 

155 200 205 

Xaa He Arg Thr Xaa Phe Arg Lys Met Ser Asn Ser Xaa Phe Pro Val 

210 215 220 

Lys Leu Thr Xaa Lys Ser Leu Xaa Xaa Xaa Phe Leu Xaa Phe Pro Lys 
225 230 235 240 

Pro Leu He Phe Pro Xaa 
245 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Pro Arg Glu Tyr Ala Leu He Leu Thr Leu Glu Arg He Lys Leu 

15 10 15 

Pro Asn Asn Val Met Gly Asp Met Lys He Arg Ser Ser Leu Ala Arg 

20 25 30 

Glu Gly Val He Gly Ser Phe Ala Trp Val Asp Pro Gly Trp Asp Gly 

35 40 45 

Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu Pro Val Glu Leu 

50 55 60 

Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He Arg Leu Glu Gly 
" 70 75 80 

Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly Ser Thr Arg Leu 

85 90 95 

Ala Phe Ser Lys Arg Lys Lys Leu Xaa Arg Leu Phe Asn Ser He Leu 

100 105 110 

Asn He Ser Cys Glu Val He Asn Val Asn Thr Cys Trp Val Gly Phe 

115 120 125 

Xaa Gly Phe Lys Leu Val Arg Trp Ala Cys He Ala Glu Asn Tyr Phe 

130 135 140 

Cys Leu Phe Phe He Tyr Leu Ser Val Asn Lys Lys Ser Asn He His 
145 150 155 160 

Thr Ser Ser Lys Arg Tyr Cys Leu Arg Asp Tyr Gin Gin Asp Leu Gly 

165 170 175 

He He Phe Asp Leu He Leu Tyr Ser Pro Phe Ser Leu Gin Phe Ala 

IBO 185 190 

Gin Asn Lys Pro Gly Xaa Tyr Thr Phe Thr Pro Leu Phe Xaa He Pro 

195 200 205 

He Asn Ser Tyr II Val Xaa Lys Asn Val Lys Phe Phe Xaa Pro Cys 
210 215 220 
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Xaa lit Asn Xaa Xaa He Phe Xaa Xaa Xaa Leu Phe He He Pro Lys 
230 

Thr Pro Asn Phe Pro Leu 
245 

(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Ser Ser Lya Gly He Arg Leu Aen Pro Asn Pro Arg Glu Asp Lys Val 

1 5 10 15 

Ala Arg Arg Cys Tyr Gly Gly Tyr Glu Asp Lys Glu Gin Phe Ser Lys 

, 20 25 30 

Arg Arg Gly Tyr Trp Phe Phe Cys Leu Gly Xaa Pro Arg Met Gly Trp 

35 40 45 

Lys Leu Asn Thr Asn Ala Leu Gin Cys Leu Lys Xaa Thr Cys Arg He 

50 55 60 

Lys He Trp Arg Glu He Cys Ala Asp Arg He Tyr Lys Ala Arg Gly 
^5 70 75 80 

Ser Gly Lys Lys Pro Leu Gin Arg Lys Leu Ser Gly Glu His Lys Val 

85 90 95 

Ser Val Phe Lys Glu Lys Glu Thr Leu Ala Ser Phe Gin Xaa His Pro 

100 105 110 

Gin Tyr Leu Val Xaa Ser Asn Gin Cys Lys Tyr Leu Leu Gly Gly Phe 

lis 120 125 

Leu Gly He Gin Thr Arg Lys Met Gly Leu Tyr Ser Arg Lys Leu Phe 

130 135 140 

Leu Pro Leu Leu His Leu Ser Phe Cys Glu Xaa Lys He Gin His Pro 
150 155 160 

Hxs Xaa Phe Gin Lys He Leu Phe Ala Xaa Leu Pro Thr Arg Ser Trp 

165 170 175 

His Tyr Phe Xaa Ser Tyr Thr Leu Phe Ser Phe Leu Pro Pro He Cys 

160 185 190 

Pro Lys Xaa Thr Trp Val Val Tyr He His Ser Ser Leu Leu Asn Ser 

155 200 205 

Tyr Lys Phe Val His Ser Leu Glu Lys Cys Gin He Leu Xaa Ser Leu 

210 215 220 

Leu Asn Xaa Pro Xaa Asn Leu Tyr Xaa Xaa Xaa Phe Tyr Asn Ser Gin 
225 230 235 240 

Asn Pro Xaa Phe Ser Pro 
245 

(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 



SUBSTITUTE SHEET (RULE 26) 



wo 98/42860 



PCT/US98/05497 



-91. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Leu HiB His Val Lye Leu He Tyr Ala Thr Lys Ser Arg Lys Leu 
^5 10 15 

Val Gly Lys Lys He Val Xaa Xaa Xaa Pro Gly Ser Ue Ala Ala 

25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe Arg Phe 

I 2 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
15 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTATTGAGTA CGAACGCCAT C 21 
(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Xi> SEQUENCE DESCRIPTION: SEQ ID N0:50: 
GTCACGCTTO CTCCACTCCG 2q 
(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 437 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanococcus Jannaschii 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met He Ser Glu He Met His Pro Thr Lye Leu Leu Lys Gly Thr Lys 

15 10 15 

Ser Lys Leu Leu Glu Asn Lys Lys He Leu Val Ala Val Thr Ser Ser 

20 25 30 

He Ala Ala He Glu Thr Pro Lys Leu Met Arg Glu Leu He Arg His 

35 40 45 

Gly Ala Glu Val Tyr Cys He He Thr Glu Glu Thr Lys Lys He He 

50 55 60 

Gly Lys Glu Ala Leu Lys Phe Gly Cys Gly Asn Glu Val Tyr Glu Glu 
" 70 75 80 

He Thr Gly Xaa Xaa Xaa Xaa Xaa Asp He Glu His He Leu Leu Tyr 

85 90 95 

Xaa Xaa Xaa Xaa Asn Glu Cys Asp Cys Leu Leu He Tyr Pro Ala Thr 

100 105 110 

Ala Asn He He Ser Lys He Asn Leu Gly He Ala Asp Asn He Val 

H5 120 125 

Asn Thr Thr Ala Leu Met Phe Phe Gly Asn Lys Pro He Phe He Val 

130 135 
Pro Ala Met His Glu Asn Met Phe Asn Xaa Xaa Ala He Lys Arg His 
145 150 155 160 

He Asp Lys Leu Lys Glu Lys Asp Lys He Tyr He He Ser Pro Lys 

165 170 175 

Phe Glu Glu Xaa Xaa Xaa Xaa Xaa Xaa Gly Lys Ala Lys Val Ala Asn 

180 185 190 

He Glu Asp Val Val Lys Ala Val He Glu Lys He Gly Asn Asn Leu 

195 200 205 

Lys Lys Glu Gly Asn Arg Val Leu He Leu Asn Gly Gly Thr Val Glu 

210 215 220 

Phe He Asp Lys Val Arg Val He Ser Asn Leu Ser Ser Gly Lys Met 
225 230 235 240 

Gly Val Ala Leu Ala Glu Ala Phe Cys Lys Glu Gly Phe Tyr Val Glu 

245 250 255 

Val He Thr Ala Met Gly Leu Glu Pro Pro Tyr Tyr He Lys Asn His 

260 265 270 

Lys Val Leu Thr ' Ala Lys Glu Met Leu Asn Lys Ala He Glu Xaa Xaa 

275 280 285 

Leu Xaa Ala Lys Asp Phe Asp He He He Ser Ser Ala Ala He Ser 

290 295 300 

Asp Phe Thr Val Glu Ser Xaa Phe Glu Gly Lys Leu Ser Ser Glu Glu 
^05 310 315 320 

Glu Xaa Xaa Xaa Xaa Leu He Leu Lys Leu Lys Arg Xaa Asn Pro Lys 
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325 330 
Val Leu Glu Glu Leu Arg Arg He Tyr Lys Asp Xaa Lys Val lie He 

340 345 350 

Gly Phe Lys Ala Glu Tyr Asn Leu Asp Glu Lys Glu Leu He Asn Arg 

355 360 365 

Ala Lys Glu Arg Leu Asn Lys Tyr Asn Leu Asn Met He He Ala Asn 

370 375 380 

Asp Leu Ser Lys Xaa Xaa His Tyr Phe Gly Asp Asp Tyr He Glu Val 
390 395 400 

Tyr He He Thr Lys Tyr Glu Val Glu Lys He Ser Gly Ser Lys Lys 

405 410 43^5 

Xaa Glu He Ser Glu Arg He Val Glu Lys Val Lys Lys Leu Val Lys 

420 425 430 

Ser Xaa Xaa Xaa Xaa 
435 

(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Ala Arg Gin Gin Lys Tyr Cys Asp Lys He Ala Asn Phe Trp 

15 10 15 

Cys His Pro Thr Gly Lys He He Met Ser Leu Ala Gly Lys Lys He 

20 25 30 

Val Leu Gly Val Ser Gly Gly He Ala Ala Tyr Lys Thr Pro Glu Leu 

35 40 45 

Val Arg Arg Leu Arg Asp Arg Gly Ala Asp Val Arg Val Ala Met Thr 

50 55 60 

Glu Ala Ala Lys Ala Phe He Thr Pro Leu Ser Leu Gin Ala Val Ser 
€5 70 75 80 

Gly Tyr Pro Val Ser Asp Ser Leu Leu Asp Pro Ala Ala Glu Ala Ala 

85 90 95 

Met Gly His He Glu Leu Gly Xaa Xaa Xaa Xaa Lys Trp Ala Asp Leu 

100 105 lie 

Val He Leu Ala Pro Ala Thr Ala Asp Leu He Ala Arg Val Ala Ala 

lis 120 125 

Gly Met Ala Asn Asp Leu Val Ser Thr He Cys Leu Ala Thr Pro Xaa 

130 135 140 

Xaa Ala Pro Val Ala Val Leu Pro Ala Met Asn Gin Gin Met Tyr Arg 
145 150 155 160 

Ala Ala Ala Thr Gin His Asn Leu Glu Val Leu Ala Xaa Ser Arg Gly 

165 170 175 

Leu Leu He Trp Gly Pro Asp Ser Gly Ser Gin Ala Cys Gly Asp He 

180 185 190 

Gly Pro Gly Arg Xaa Xaa Asp Pro Leu Thr He Val Asp Met Ala Val 

195 200 205 

Ala His Phe S r Pro Val Asn Asp Leu Lys His Leu Asn He Met He 
210 215 220 
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Thr Ala Gly Pro Thr Arg Glu Pro Leu Asp Pro Val Arg Tyr lie Ser 
"5 230 235 240 

Asn His Ser Ser Gly Lys Met Gly Phe Ala He Ala Ala Ala Ala Ala 

24S 250 255 

Arg Arg Gly Ala Asn Val Thr Leu Val Ser Gly Pro Val Ser Leu Pro 

260 265 270 

Thr Pro Pro Phe Val Lys Arg Val Asp Val Met Thr Ala Leu Glu Met 

275 280 285 

Glu Ala Ala Val Asn Xaa Xaa Ala Ser Val Gin Gin Gin Asn He Phe 

290 295 300 

He Gly Cys Ala Ala Val Ala Asp Tyr Arg Ala Ala Thr Val Ala Pro 
310 315 320 

Glu Lys He Lys Lys Gin Ala Thr Gin Gly Asp Glu Leu Thr He Lys 

325 330 335 

Met Val Lys Xaa Asn Pro Asp He Val Ala Gly Val Ala Ala Leu Lys 

340 345 350 

Asp His Arg Pro Tyr Val Val Gly Phe Ala Ala Glu Thr Asn Asn Xaa 

355 360 365 

Xaa xaa Xaa Val Glu Glu Tyr Ala Arg Gin Lys Arg He Arg Lys Asn 

370 375 380 

Leu Asp Leu He Cys Ala Asn Asp Val Ser Gin Pro Thr Gin Gly Phe 
390 395 400 

Asn Ser Asp Asn Asn Ala Leu His Leu Phe Trp Gin Asp Gly Asp Lys 

405 410 415 

Val Leu Pro Leu Glu Arg Lys Glu Leu Leu Gly Gin Leu Leu Leu Asp 

420 425 430 

Glu He Val Thr Arg Tyr Asp Glu Lys Asn Arg Arg 
435 440 
(2) INFORMATION FOR SEQ ID NO: S3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
Ala Gly Trp He Asp Ala Gly Phe Lys'ciy Lys He Thr Leu 
Is 10 
(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 
(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:59; 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
CAGGACTCAG AAGCTGCTAT CGAA 
(2) INFORMATION FOR SEQ ID N0:61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI "SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CTGCACGTGC CCTGTAGGAT TTGT 
2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTATTGAGTA CGAACGCCAT C 
(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GTCACGCTTG CTCCACTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: untaown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GAGGA6A6CA GGAAAOGTGa AAC 
(2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
His His Val Lys Leu He Tyr Ala 
1 5 
(2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
Lys Tyr Asp Ala Val He Met Ala 
1 5 
(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
Glu Glu Asn Gin Val Val Leu 
1 5 
(2) INFORMATION FOR SEQ ID NO: 69: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
Pro Asp Trp Lys He Arg Lys Glu 
1 S 
(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
ATGCTACTTC CAGACTG6AA AATCAGAAAA GAAATACTTA TAGAGCCATT TTCTGAAGAA 60 
TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGGCAGAG AGGCTTTTGT TAAGGGGAAA 120 
TTAATCGACG TGGAAAAGGA AGGAAAAGTC GTTATTCCTC CAAGGGAATA CGCCTTAATC 180 
CTAACCCTCG AGAGGATAAA GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC 240 
AGTTTAGCAA GAGAAGGGGT TATTGGTTCT TTTGCTTGGG TTGACCCAGG ATGGGATGGA 300 
AACTTAACAC TAATGCTCTA CAATGCCTCA AATGAACCTG TCGAATTAAG ATATGGAGAG 360 
AGATTTGTGC AGATCGCATT TATAAGGCTA GAGGGTCCGG CAAGAAACCC TTACAGAGGA 420 
AACTATCAGG GGAGCACAAG GTTAGCGTTT TCAAAGAGAA AGAAACTCTA G 471 
(2) INFORMATION FOR SEQ ID NO: 71: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(iii) HYPOIHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 



Met 


Leu Leu 


Pro 


Asp 


Trp 


Lys 


He 


Arg 


Lys 


Glu 


He Leu 


He 


Glu Pro 


1 






5 










10 








15 


Phe 


Ser Glu 


Glu 


Ser 


Leu 


Gin 


Pro 


Ala 


Gly 


Tyr 


Asp Leu 


Arg 


Val Gly 






20 










25 








30 


Arg 


Glu Ala 


Phe 


val 


Lys 


Gly 


Lys 


Leu 


He 


Asp 


Val Glu 


Lys 


Glu Gly 




35 










40 








45 


Lys 


Val Val 
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Pro Val Glu Leu Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He 

lis 120 125 

Arg Leu Glu Gly Pro Ala Arg Aan Pro Tyr Arg Gly Asn Tyr Gin Gly 

130 135 140 

Ser Thr Arg Leu Ala Phe Ser Lys Arg Lys Lys Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unJcnown 

(D) TOPOLOGY: unicnown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 
Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
Ala Gly Trp He Asp Ala Gly Phe Lys Gly Lys He Thr Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



SUBSnrUTE sheet (rule 26) 



wo 98/42860 



PCT/US98/05497 



-100- 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:75: 
Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : untaown 

(D) TOPOLOGY: un)cnown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
Pro Thr lie Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unicnown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
Val Gly Leu He Asp Ser Asp Tyr Gin Gly Gin Leu Met He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(V) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
Ala Gly Val Val Aep Arg Asp Tyr Thr Gly Glu Val Lya Val 
15 10 
(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unlcnown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
Ala Gly Val He Asp Glu Asp Tyr Arg Gly Asn Val Gly Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: untaown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
Thr Gly Leu He Asp Pro Gly Phe Gin Gly Glu Leu Lys Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE; DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GACGACGACA AGATGCTACT TCCAGACTGG AAA 33 
(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGAACAAGAC CCGTCCCACT TTCACAGATG AAGAG 35 
(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
CTCCATGTCC CAACTCCGAT CAC 23 
(2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GGTTTTCCCA GTCACGACGT TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGUUUUCCCA GUCACGACGU UGUAAAACGA CGGCCAGU 
(2) INFORMATION FOR SEQ ID NO: 88: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GACGACGACA AGATGCCCTG CTCTGAAGAG ACACC 
(2) INFORMATION FOR SEQ ID NO: 89: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: untaiown 
(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GGAACAAGAC CCGTTTAATT CTTTCCAGTG GAACC 
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What is claimed is: 

1 . A non-naturally occurring composition of matter comprising at least one 
component possessing nucleic acid polymerase enhancing activity selected from the 
group consisting of: an isolated or purified naturally-occurring polymerase enhancing 
protein obtained from a bacterial, eukaryotic. or archael source; a wholly or partially 
synthetic protein having the same amino acid sequence as said naturally-occurring protein 
or an analog thereof possessing polymerase enhancing activity; polymerase-enhandng 
mixtures of one or more of said naturally occurring or wholly or partially synthetic proteins; 
a polymerase-enhancing protein complex of one or more of said naturally occurring or 
wholly or partially synthetic proteins; and a polymerase enhancing partially purified cell 
extract containing one or more of said naturally occumng proteins. 

2. A composition of matter according to claim 1 , wherein said component 
possessing polymerase enhancing activity is a cell extract 

3. A composition of matter according to claim 2, wherein said cell extract is 
from an archael source. 

4. A composition of matter according to claim 3, wherein said cell extract is 
from Pyrococcus furiosus. 

5. A composition of matter according to claim 1 , wherein said component 
possessing polymerase enhancing activity is a protein complex. 

6. A composition of matter according to claim 5, wherein said protein complex 
is from an archael source. 

7. A composition of matter according to claim 6, wherein said protein complex 
is from Pyrococcus furiosus. 

8. A composition of matter according to claim 7, wherein said protein complex 
is P300. 

9. A composition of matter according to claim 5, wherein said protein complex 
comprises a plurality of subunits wherein at least one subunit has a molecular weight of 
approximately 17-18kD in the fully denatured, monomeric form. 

10. A composition of matter according to claim 9, wherein a subunit has a 
sequence of amino acids at the amino terminal end comprising SEQ ID NO: 46. 

11. A composition of matter according to claim 9, wherein a subunit has a 
sequence of amino acids comprising one of SEQ ID NO: 47 or 48. 
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12. A composition of matter according to claim 10 or 11, further comprising a 
subunit encoded by a DNA comprising the nucleotide sequence of SEQ ID NO: 18. a 
degenerate variant thereof, or a sequence hybridizable to the complement of SEQ ID 
N0:18. 

1 3. A composition of matter according to claim 1 , wherein said component 
possessing polymerase enhancing activity is a protein. 

14. A composition of matter according to claim 1 3, wherein said protein is from 
an archael source. 

15. A composition of matter according to claim 14. wherein said protein is from 
Pyrococcus furiosus, 

16. A composition of matter according to claim 1 5, wherein said protein 
comprises at least one protein having a molecular weight of approximately 17.1BkD in the 
fully denatured, monomeric fonm. 

17. A composition of matter according to claim 16, wherein said protein is 
selected from the group consisting of: a protein having a sequence of amino acids at the 
amino terminal end or within about 20 amino acids from the amino terminal end 
comprising one of SEQ ID NO: 69 or 1 1; a protein encoded by a nucleic acid comprising 
the sequence of SEQ ID NO: 70, a degenerate variant thereof, or a sequence hybridizable 
to the complement of the sequence of SEQ ID NO: 70; or a protein having a sequence of 
amino acids comprising SEQ ID NO: 71. 

18. A composition of matter according to claim 1 , wherein said component 
possessing polymerase enhancing activity is a wholly or partially synthetic protein 
comprising the same amino acid sequence as said naturally-occuning protein or an 
analog thereof. 

19. A composition of matter according to claim 18, wherein said protein has a 
molecular weight of approximately 17-18kD in the fully denatured, monomeric form. 

20. A composition of matter according to claim 19, wherein said protein 
comprises a sequence of amino acids at the amino temninal end or within 20 amino acids 
of the amino terminal end comprising one of SEQ ID NO: 1 1 or 69. 

21 . A composition of matter according to claim 19, wherein said protein has a 
sequence of amino acids comprising SEQ ID NO: 71. 

22. A composition of matter according to claim 20 or 21 , further comprising a 
subunit encoded by a DNA comprising the nudeotid sequence of SEQ ID NO: 70. 
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23. A composition of matter according to claim 1 . wherein said component 
possessing polymerase enhancing activity is a mixture of proteins. 

24. An isolated or purified DNA comprising a sequence encoding a protein 
according to the protein of one of claims 16 or 17. 

25. An isolated or purified DNA comprising a sequence selected from the group 
consisting of: the sequence set forth in SEQ ID NO: 18. a degenerate sequence thereof, 
or a DNA sequence hybridizable to the complement of SEQ ID NO: 18; the sequence set 
forth in SEQ ID NO: 70, a degenerate sequence thereof, or a DNA sequences 
hybridizable to the complement of SEQ ID NO: 70. 

26. An isolated or purified DNA sequence hybridizable to the complement of a 
DNA sequence according to claim 24 under stringent conditions. 

27. An isolated or purified DNA sequence hybridizable to the complement of a 
DNA sequence encoding a protein according to daim 20. 

28. A composition of matter comprising a polymerase-enhancing protein 
encoded by DNA according to claim 26. 

29. A composition of matter comprising a polymerase-enhancing protein 
encoded by DNA according to daim 27. 

30. A non-naturally occurring composition comprising a polymerase-enhandng 
composition according to daim 1 and one or more DNA polymerases. 

31 . A composition according to claim 30, wherein at least one of said 
polymerases is a thermostable DNA polymerase. 

32. A composition according to claim 30. wherein at least one of said 
polymerases is derived from an archael source. 

33. A composition according to claim 31 , wherein at least one of said 
polymerases is a DNA polymerase derived from the Pyrococcus species or the 
Thermococcus species. 

34. A mixture according to claim 31 , wherein at least one of said polymerases 
is Pyrococcus furiosus, Pyrococcus sp. JDF3, Pyrococcus sp. GBD, Pyrvcoccus sp. KOD, 
Thermococcus litoralis, or Pyrococcus woes/7 DNA polymerase. 

35. A kit for replicating nudeic acids comprising a polymerase-enhancing 
composition of claim 1 and at least one nudeic add polymerase. 

36. A kit according to daim 35, comprising at least one recombinant nucleic 
add polymerase, 
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37. A kit according to claim 35 or 36, which can be used in a site^irected 
nnutagenesis method. 

38. A kit according to claim 35 or 36, which can be used in a nucleic acid 
sequencing method. 

39. A kit according to claim 35 or 36, which can be used in an amplification 
reaction. 

40. A method of enhancing a nucleic acid polymerase reaction comprising, in 
any appropriate order, combining (a) a nucleic acid sequence template for a nucleic acid 
polymerase, (b) at least one nucleic acid polymerase, and (c) a polymerase enhancing 
composition according to claim 1 . 

41 . A method according to claim 40, wherein said reaction is a replication 
reaction. 

42. A method according to claim 40, wherein said reaction comprises an 
amplification reaction. 

43. A method according to claim 40. wherein said reaction comprises a PCR 
process or RT-PCR pnjcess. 

44. A method according to claim 41 , 42, or 43, further comprising a site- 
directed mutagenesis process, a cycle sequencing process, or a cloning process. 

45. An antibody that binds to a composition of matter of claim 1 . 

46. An antibody that binds to a protein having an amino acid sequence 
comprising one of SEQ ID NO: 19 or 71, 

47. A method for purifying a polymerase-enhancing protein comprising: 

(a) solubilizing the protein from archae cells while substantially 
maintaining protein:protein interactions; 

(b) performing heparin sepharose chromatography on the product of 
(a); 

(c) performing size exclusion chromatography on the product of (b); 
and 

(d) identifying a polymerase enhancing activity. 

48. A method for identifying the presence or absence of a composition of 
matter with polymerase enhancing activity, comprising adding a protein extract from cells 
to a nucleic acid polymerization reaction and measuring the number of products produced 
in said polymerization reaction compared to the number of products in a control reaction. 
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49. The method of claim 48. wherein the polymerase activity employed in said 
nucleic acid polymerization reaction comprises activity of at least one of native or cloned 
bacterial DNA polymerase, native or cloned archael DNA polymerase, native or cloned 
polymerase from the Pyococcus species or the Thennococcus species, Pyrococcus 
furiosus DNA polymerase, native or cloned reverse transcriptase, or native or cloned RNA 
polymerase. 

50. The method of claim 48. v^herein the polymerization reaction comprises 
one of a PGR process or RT-PCR process. 

51 . The method of claim 49, wherein the polymerization reaction comprises 
one of a PGR process or RT-PGR process. 

52. A method for identifying DNA encoding polymerase-enhancing activity in a 
sample comprising contacting a nucleic acid probe comprising all or a portion of the 
sequence of nucleotides from SEQ ID NO: 70 or a complement thereof, or comprising a 
sequence of nucleotides encoding all or a portion of the amino acid sequence of SEQ ID 
NO: 71 or a complement thereof, to nucleic acids of the sample and isolating a nucleic 
add capable of hybridizing to said probe. 

53. A method as claimed in claim 52, wherein a hybridization condition is 
employed comprising a low stringency virash in a solution comprising approximately .45 M 
NaCI. approximately .045 M trisodium citrate, and approximately .1% SDS, and wherein 
the wash temperature is approximately 37° to approximately 42'*C. 

54. A method as claimed in daim 53, wherein a hybridization buffer is 
employed comprising approximately .75M NaCI. approximately .075 M trisodium citrate, 
and approximately 50% formamide. and wherein a hybridization wash is employed 
comprising approximately .1 M phosphate, approximately .1 x SET. approximately .1% 
sodium pyrophosphate, and approximately .1% SDS at approximately 45"G. 

55. A method for identifying DNA encoding material having polymerase- 
enhandng activity in a sample comprising perfonning an amplification reaction with at 
least one primer capable of hybridizing to a DNA encoding material having polymerase- 
enhandng activity or a protein component thereof. 

56. A method as claimed in daim 55. wherein at least one primer comprises an 
at least 15 nudeotide portion of the DNA of SEQ ID NO: 70, or a DNA comprising a 
sequence encoding an at least 5 amino add portion of SEQ ID NO: 71 , or a DNA 
comprising the DNA of one of daims 25-27. 
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57. A method for identifying polymerase-enhancing activity in a sample 
comprising contacting an antibody of claim 45 or 46 with said sample and detecting 
protein binding to said antibody. 

58. A DNA construct comprising a sequence encoding PEF protein P45 
operably linked to an expression vector. 

59. A P45 protein produced from a cell containing a DNA construct as claimed 
in claim 58, wherein the protein is in monomeric. dimeric. or multimeric form. 

60. A P45 protein as claimed in claim 59. wherein the cell is a bacterial cell. 

61. A PEF complex comprising a P45 protein as claimed in daim 59. 

62. An antibody that binds to a P45 protein as claimed in claim 59, 

63. An antibody that binds toaPEF complex as claimed in claim 6 1 . 

64. A P45 protein produced from a cell containing a DNA construct as claimed 
in claim 58, wherein the P45 protein is produced as a fusion protein. 

65. A P45 protein as claimed in claim 64, wherein the fusion protein comprises 
a calmodulin binding peptide. 

66. A P45 protein as claimed in claim 65, wherein the expression vector is 
pCAL-n-EK. 

67. A kit for replicating nucleic acids comprising at least one polymerase, a P45 
protein as claimed in daim 59, and reagents for performing a polymerization reaction. 

68. A kit as claimed in daim 67, wherein the P45 protein is present in a PEF 
complex. 

69. A method of enhandng a nucleic acid polymerase reaction comprising 
adding a P45 protein as claimed in daim 59 to a polymerization reaction. 

70. A method of enhandng a nudeic acid polymerase reaction as daimed in 
claim 69, wherein the P45 protein is present in a PEF complex. 

71 . A nrjethod of enhandng a nucleic acid polymerase readion comprising 
performing the reaction in the presence of one or more of the following: a PEF; dUTPase 
activity; a protein that turns-over dUTP; a protein having one or more of SEQ ID NO: 72- 
81. 

72. A method as daimed as daimed in claim 71 , wherein the dUTPase activity 
comprises one or more of a P45 protein produced from a cell containing a DNA construct 
comprising a sequence encoding PEF protein P45 operably linked to an expression vedor, 
wherein the protein is in monomeric. dimeric, or multimeric form, a human dUTPas , a 
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bacterial dUTPase, an archael dUTPase, a yeast dUTPase, a mammalian dUTPase. or an 
animal dUTPase. 

73. A method for controlling the activity of a polymerase in a polymerization 
reaction, comprising changing the amount of dUTP present or generated during the 
reaction by adding a material having PEF activity. 

74. A method as claimed in claim 73. wherein the PEF activity comprises a P45 
protein produced from a cell containing a DNA construct comprising a sequence encoding 
PEF protein P45 operably linked to an expression vector, wherein the protein is in 
monomeric. dimeric, or multimeric form. 

75. A method for detecting the presence or absence of PEF activity in a sample 
comprising adding the sample to a nucleic acid polymerase reaction containing dUTP or 
dCTP and monitoring the inorganic pyrophosphate levels. 

76. A method for detecting the presence or absence of PEF activity in a sample 
comprising adding the sample to a nucleic acid polymerase reaction containing dUTP and 
monitoring any change in polymerization levels. 

77. A non-naturally occuning composition of matter comprising a P45 protein. 

78. A composition of matter as claimed in claim 77, wherein the P45 protein is 
in monomeric, dimeric, or multimeric fomi. 

79. A composition of matter as claimed in claim 77, wherein the P45 protein is 
present in a protein complex. 

80. A composition of matter as claimed in claim 77, wherein the P45 protein is 
an analog P45 protein. 

81 . A method of producing a P45 protein comprising transferring a DNA 
construct as claimed in claim 58 into a host cell and expressing the P45 protein. 

82. A method as claimed in claim 81 . wherein the P45 protein is expressed as 
a fusion protein. 

83. A method of producing a PEF analog protein comprising introducing at 
least one mutation into the sequence encoding P45 protein of the DNA construct of claim 
58 or into a sequence encoding a dUTPase protein, transfening the sequence to a host 
cell, and expressing the PEF analog protein. 

84. A DNA encoding a PEF activity comprising (a) one or more of SEQ ID NO.: 
32-35, 82. 83, or 70. (b) a sequence hybridizable to the complement of one or more those 
sequences under stringent conditions, or (c) a deg nerate variant of (a) or (b). 
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es. A protein having PEF activity comprising one or more of SEQ ID NO.: 72- 

81. 

86. A nfiethod for cloning a PEF activity comprising employing the complement 
of one or more nucleic adds comprising one or more of SEQ ID NO: 32-35. 82. 83. or the 
complement of all or a portion of SEQ ID NO: 70. and identifying a clone containing a 
sequence that hybridizes to the one or more nucleic acids. 

87. A PGR enhancing, protein extract comprising purified proteins from 
Thermus thermophilic that possesses dUTPase activity. 

88. A protein extract of claim 87, which comprises a protein that can be bound 
by an antibody specific for recombinant Pfu P45 protein. 

89. A protein extract of claim 88, wherein the protein possesses a molecular 
weight of approximately 24kD in an SDS-PAGE gel. 

90. A protein extract of claim 87. which comprises a protein that possesses a 
molecular weight of approximately 24kD in an SDS-PAGE gel. 

91 . A composition comprising a protein extract as claimed in claim 87. 

92. A composition comprising a protein extract as claimed in claim 87. and 
further comprising a themnostable DNA polymerase. 

93. A composition comprising a protein extract of claim 88 and a themiostable 
DNA polymerase. 

94. A computer readable medium having stored in it the full or partial amino 
acid or DNA sequence information of a PEF protein. 

95. A computer-based method of screening for a PEF. comprising providing a 
computer readable medium as claimed in claim 94, and identifying other sequences in a 
database that possess sequence homology, similarity, or identity to all or a portion of the 
sequence stored in the computer readable medium. 
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Figure 5 
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Figure 7 
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Figure 9 
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Figure 10 
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Figure 11 
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Figure 12 
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